Optimized Server for Streamed Applications 
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This application Claims benefit of U.S. Provisional Patent Application Serial 
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BACKGROUND OF THE INVENTION 

TECHNICAL FIELD 

Tlie Invention relates to the streaming of computer program object code 
across a network In a computer environment. More particularly, the 
Invention relates to optimizing server performance for the delivery and 
management of streaming applications to client systems across a computer 
network. 



DESCRIPTION OF THE PRIOR ART 

Retail sales models of computer application programs are fairly straight 
forward. The consumer either purchases the application program from a 
retailer that is either a brick and mortar or an ecommerce entity. The product 
is delivered to the consumer in a shrink-wrap form. 

The consumer installs the program from a floppy disk or a CD-ROM Included 
in the packaging. A serial number is generally provided that must be 
entered at installation or the first time the program is run. Other approaches 



1 



require that the CD-ROM be present whenever the program is run. However, 
CD-ROiVIs are easily copied using common CDR technology. 
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Another approach is for the consumer to effectuate the purchase through an 
5 ecommerce entity. The application program is downloaded in its entirety to 
the consumer across the Internet. The consumer is emailed a serial number 
that is required to run the program. The consumer enters the serial number 
at the time the program is installed or the first time the program Is run. 

10 Once the application program is installed on a machine, it resides on the 



=3J machine, occupying precious hard disk space, until it is physically removed. 

^1 The installer portion of the program can also be installed on a server along 

fll with the installation files. Users within an intranet can install the program 

'T^ from the server, across the network, onto their machines. The program is a 

f f 1 5 full installation of the program and resides on the user's machine until it is 
manually removed. 



^ Trial versions of programs are also available online that are a partial or full 

installation of the application program. The program executes normally for a 
20 preset time period. At the end of the time period, the consumer Is told that he 
must purchase the program and execution is terminated. The drawback to 
this approach is that there is an easy way for the consumer to fool the 
program. The consumer simply uninstalls the program and then reinstalls it, 
thereby restarting the time period. 



25 



30 



Additionally, piracy problems arise once the application program is resident 
on the consumer's computer. Serial numbers for programs are easily 
obtained across the Internet. Software companies lose billions of dollars a 
year in revenue because of this type of piracy. 

The above approaches fail to adequately protect software companies' 
revenue stream. These approaches also require the consumer to install a 
program that resides indefinitely on the consumer's hard disk, occupying 
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valuable space even though the consumer may use the program 
infrequently. 



The enterprise arena allows Application Service Providers (ASP) to provide 
5 browser-based implementations such as Tarantella offered by Santa Cruz 
Operation, Inc. in Santa Cruz, California and Metaframe offered by Citrix 
Systems Inc. of Fort Lauderdale, Florida. A remote application portal site 
allows the user to click on an application in his browser to execute the 
application. The application runs on the portal site and GUI interfaces such 
10 as display, keystrokes and mouse clicks are transferred over the wire. The 
9 access to the program is password protected. This approach allows the 

*=^! provider to create an audit trail and to track the use of an application 

m program. 
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15 AppStream Inc. of Palo Alto, California uses Java code streamlets to provide 
5 streaming applications to the user. The system partitions a Web application 

program into Java streamlets. Java streamlets are then streamed to the 
user's computer on an as-needed basis. The application runs on the user's 
computer, but is accessed through the user's network browser. 

20 

The drawback to the browser-based approaches is that the user is forced to 
work within his network browser, thereby adding another layer of complexity. 
The browser or Java program manages the application program's run-time 
environment. The user loses the experience that the software manufacturer 
25 had originally intended for its product including features such as application 
invocation based on file extension associations. 



It would be advantageous to provide an optimized server for streamed 
applications that optimizes the delivery system of streamed application 
30 program servers. It would further be advantageous to provide an optimized 
server for streamed applications that efficiently manages the streamed 
application programs resident on a streamed application program server. 
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SUMMARY OF THE INVENTION 
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The invention provides an optimized server for streamed applications. The 
5 system enables a server to efficiently stream application programs to a 
plurality of clients across a computer network. In addition, the invention 
provides a system that efficiently manages the storage and retrieval of 
streaming application program source and data files resident on a server. 

10 The invention provides a streamed application server optimized to provide 
efficient delivery of streamed applications to client systems across a 
computer network such as the Internet The server persistently stores 
streamed application program sets that contain streamed application file 
7 pages. Client systems request streamed application file pages from the 

ft 1 5 server using a unique set of numbers common among all servers that store 
C3 the particular streamed application file pages. 

The streamed application file pages are read-only making it easier for the 
server to maintain the streaming application file pages. 

20 

Streamed application file pages are only returned to clients with the proper 
access privileges. Another preferred embodiment of the invention provides 
a license server. The license server offloads the streamed application server 
by performing the client access privilege validations. The license server 
25 provides a client with an access token that contains information regarding 
access rights, the application that it applies to, and an expiration time. 



The streamed application server receives an access token from a requesting 
client, decrypts and validates the access token's contents, and grants access 
30 to the requesting client having a valid access token. The streamed 
application server decrypts access tokens using a secret key shared with the 
license server. Recently approved access tokens are kept track of by the 
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streamed application server and incoming access tokens that match a 
recently approved access token are approved without further processing. 

Commonly accessed streamed application file pages are stored in a cache 
on the streamed application server. The sender attempts to retrieve 
requested streamed application file pages from the cache before retrieving 
them from persistent storage. 

Requested streamed application file pages are compressed before being 
sent to a client. When a client requests multiple streamed application file 
pages, they are concatenated together and the entire set compressed at 
once before sending to the client. The commonly accessed streamed 
application file pages are also compressed before being stored in the cache. 

Another preferred embodiment of the invention compresses all of the 
streamed application file pages In the streamed application sets before 
being stored in the persistent storage. 

Streamed application servers profile the access patterns of streamed 
application file pages. The profiles are used by clients to guide their 
prefetching of streamed application file pages and by the server to pre- 
package and compress groups of streamed application file pages to be sent 
to a client that requests pages within a set and to perform prefetching of 
streamed application file pages for pushing to clients. 

Individual servers across the network can be assigned a specific set of 
streamed applications sets to serve to clients to improve the response time 
for certain applications. Servers can be asymmetrically assigned different 
sets of streamed application sets to Improve overall server efficiency. 
Individual servers can also be dynamically assigned streamed application 
sets to match client accesses over time. 
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A central control server is also provided where individual servers 
periodically send a summary of their file access patterns. The central server 
then generates prefetching information from the aggregate file access 
patterns and updates the individual servers with the prefetching information. 

Other aspects and advantages of the invention will become apparent from 
the following detailed description in combination with the accompanying 
drawings, illustrating, by way of example, the principles of the invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 



Fig. 1 is a block schematic diagram of a preferred embodiment of the 
invention showing components on the server that deal with users 
15 subscribing to and running applications according to the invention; 



Fig. 2 is a block schematic diagram of a preferred embodiment of the 
M invention showing the client components supporting application delivery and 

execution according to the invention; 



Fig. 3 is a block schematic diagram of a preferred embodiment of the 
invention showing the components needed to install applications on the 
client according to the invention; 



25 Fig. 4 is a block schematic diagram of the Builder that takes an existing 
application and extracts the Application File Pages for that application 
according to the invention; 

Fig. 5a is a block schematic diagram illustrating how the Client Network 
30 Spoofer is used to handle mapping TCP interfaces to HTTP interfaces 
according to the invention; 
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Fig. 5b is a block schematic diagram illustrating how the Client Network 
Spooler is used to handle mapping TCP interfaces to HTTP interfaces 
according to the invention; 

Fig. 6a is a block schematic diagram showing several different components 
of the client software according to the Invention; 

Fig. 6b is a block schematic diagram showing the use of volatile and non- 
volatile storage of code and data in the client and server according to the 
invention; 

Fig. 7a is a block schematic diagram showing one of two ways in which data 
may be compressed while in transit between the server and client according 
to the Invention; 

Fig. 7b is a block schematic diagram showing the other way in which data 
may be compressed while In transit between the server and client according 
to the Invention; 

Fig. 8 is a block schematic diagram showing an organization of the 
streaming client software according to the invention; 

Fig. 9 Is a block schematic diagram showing an alternative organization of 
the streaming client software according to the invention; 

Fig. 10 Is a block schematic diagram showing the application streaming 
software consisting of a streaming block driver according to the Invention; 

Fig. 11 Is a block schematic diagram showing the application streaming 
software has been divided into a disk driver and a user mode client 
according to the invention; 



Fig. 12 is a block schematic diagram showing the unencrypted and 
encrypted client caches according to the invention; 
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Fig. 13 is a block schematic diagram showing an application generating a 
sequence of code or data requests to the operating system according to the 
Invention; 

Fig. 14 is a block schematic diagram showing server-based prefetching 
according to the invention; 



g Fig. 15 is a block schematic diagram showing a client-to-ciient 

communication mechanism that allows local application customization to 
travel from one client machine to another without involving server 
communication according to the Invention; 



Fig. 16 is a block schematic diagram showing a client cache with extensions 
for supporting local file customization according to the invention; 



Fig. 17 is a block schematic diagram showing aspects of a preferred 
20 embodiment of the Invention related to load balancing and hardware fail 
over according to the invention; 

Fig. 18 is a block schematic diagram showing the benefits to the use of 
compression in the streaming of Application File Pages according to the 
25 invention; 

Fig. 19 is a block schematic diagram showing pre-compression of 
Application File Pages according to the invention; 

30 Fig. 20 is a block schematic diagram showing multi-page compression of 
Application File Pages according to the invention; 
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Fig. 21 is a block schematic diagram showing profile-based prefetching 
according to the invention; 

Fig. 22 is a block schematic diagram showing the use of tokens and a 
5 License Server according to the invention; 



Fig. 23 is a block schematic diagram showing a flowchart for the Builder 
Install Monitor according to the invention; 

Q 10 Fig. 24 is a block schematic diagram showing a flowchart for the Builder 
Application Profiler according to the invention; 
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Fig. 25 is a block schematic diagram showing a flowchart for the Builder 
SAS Packager according to the invention; 

Fig. 26a is a block schematic diagram showing versioning support according 
to the invention; 



Fig, 26b is a block schematic diagram showing versioning support according 
20 to the Invention; 



Fig. 27 is a block schematic diagram showing a data flow diagram for the 
Streamed Application Set Builder according to the invention; 



25 Fig. 28 is a block schematic diagram showing the Streamed Application Set 
format according to the invention; 



Fig. 29 is a block schematic diagram showing an SAS client using a device 
driver paradigm according to the invention; 

30 

Fig. 30 is a block schematic diagram showing an SAS client using a file 
system paradigm according to the invention; 
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Fig. 31a through 31 h is a schematic diagram showing various components 
of the AppinstallBlock format according to the invention; 
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Fig. 32 Is a block schematic diagram showing the Application Install Block 
5 lifecycle according to the invention; 

Fig. 33 is a block schematic diagram showing peer caching according to the 
invention; 

10 Fig. 34 is a block schematic diagram showing proxy caching according to the 
invention; 

Fig. 35 is a block schematic diagram showing multicast within a LAN and a 
packet protocol according to the invention; 
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Fig, 36 is a block schematic diagram showing concurrent requests for 
application server pages, for the case when the page is first requested 



through the proxy according to the invention; 



20 Fig. 37 is a block schematic diagram showing concurrent requests for 
application server pages, for the case when the page is first requested 
through the peer caching according to the invention; 

Fig. 38 is a block schematic diagram showing concurrent requests for 
25 application server pages, for the case when the page is received only 
through peer caching according to the invention; 



Fig. 39 is a block schematic diagram showing a client-server system using 
peer and proxy caching according to the invention; 

30 

Fig. 40 is a block schematic diagram showing a preferred embodiment of the 
invention preventing piracy of remotely served, locally executed applications 
according to the invention; 
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Fig. 41 is a block schematic diagram showing the filtering of accesses to 
remote application files according to the invention; 



5 Fig. 42 is a block schematic diagram showing the filtering of accesses to 
remote files based on process code location according to the invention; 
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Fig. 43 is a block schematic diagram showing the filtering of accesses to 
remote files based on targeted file section according to the invention; 

Fig. 44 is a block schematic diagram showing the filtering of accesses to 
remote files based on surmised purpose according to the invention; and 



Fig. 45 is a block schematic diagram showing the filtering of accesses to 
1 5 remote files based on past access history according to the invention. 



DETAILED DESCRIPTION OF THE INVENTION 



20 The invention is embodied in an optimized server for streamed applications. 
A system according to the invention enables a server to efficiently stream 
application programs to a plurality of clients across a computer network. In 
addition, the invention provides a system that efficiently manages the 
storage and retrieval of streaming application program source and data files 

25 resident on a server. 



The invention provides a highly efficient and secure application delivery 
system in conjunction with the adaptively optimized execution of applications 
across a network such as the Internet, a corporate intranet, or a wide area 
30 network. This is done in such a way that existing applications do not need to 
be recompiled or recoded. Furthermore, the invention is a highly scalable, 
load-balancing, and fault-tolerant system that provides anti-piracy protection 
of the streamed applications. 
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When using the invention, an end-user requests applications that are 
resident on remote systems to be launched and run on the end-user's local 
system. The end-user's local system is called the client or client system, e.g., 
5 a desktop, laptop, palmtop, or information appliance. A remote system is a 
called a server or server system and Is located within a collection of one or 
more servers called a server cluster. 

From the point of view of the client system, the application appears to be 
10 installed locally on the client even though it was initially installed on a 
different computer system. The applications execute locally on the client 
system and not on the server system. To achieve this result, the application 
is converted into a form suitable for streaming over the network. The 
streaming-enabled form of an application is called the Streamed Application 
15 Set (SAS) and the conversion process is termed the SAS Builder. The 
conversion of an application into its SAS form typically takes place on a 
system different from either an end-user client system or an Application 
Service Provider Server Cluster. This system Is called the SAS Conversion 
System or, simply, the conversion system. 

20 

Components of the invention are installed on the client system to support 
activities such as the installation, invocation, and execution of a SAS-based 
application. Other components of the invention are installed on the server 
system to support activities such as the verification of end user application 
25 subscription and license data and the transfer and execution of a SAS- 
based application on the client system. Some of the client and some of the 
server components run in the kernel-mode while other components run in 
the usual user-mode. 

30 The term Application Service Provider (ASP) refers to an entity that uses the 
server components on one or more server systems, /.e., an ASP Server 
Cluster, to deliver applications to end-user client systems. Such an entity 
could be, for example, a software manufacturer, an e-commerce vendor that 
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rents or leases software, or a service department within a company. The 
invention enables an ASP to deliver applications across a network, in a 
highly efficient and secure way; the applications are adaptively optimized for 
execution on an end-user's client system. 

A number of techniques are employed to increase the overall performance of 
the delivery of an application and its subsequent execution by minimizing 
the effect of network latency and bandwidth. Among the techniques 
employed are: the SAS Builder identifies sequences of frequently accessed 
application pages and uses this information when generating a SAS; 
individual SAS pages and sequences of SAS pages are compressed and 
cached in an in-memory cache on the server system; various aspects of the 
applications are monitored during their actual use on a client and the 
resulting profiling data is used by the client to pre-fetch (pull) and by the 
server to send (push) additional pages which have a high likelihood of being 
used prior to their actual use; and SAS pages are cached locally on a client 
for their immediate use when an application is invoked. 

Aggregate profile data for an application, obtained by combining the profile 
data from all the end-user client systems running the application, is used to 
increase the system performance as well. A number of additional caching 
techniques that improve both system scalability and performance are also 
employed. The above techniques are collectively referred to as 
collaborative caching. 

In an embodiment of the invention, the SAS Builder consists of three phases: 
Installation monitoring, execution profiling, and application stream 
packaging. In the final SAS Builder phase, the Application Stream Packager 
takes the information gathered by the Application Install Monitor and the 
Application Execution Profiler and creates the SAS fomn of the application, 
which consists of a Stream Enabled Application Pages File and a Stream 
Enabled Application Install Block. 
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The Stream Enabled Application Install Block is used to install a SAS-based 
application on a client system while selected portions of the Stream Enabled 
Application Pages File are streamed to a client to be run on the client 
system. The Stream Enabled Application Install Block is the first set of data 
to be streamed from the server to the client and contains, among other 
things, the information needed by the client system to prepare for the 
streaming and execution of the particular application. Individual and 
aggregate client dynamic profile data is merged into the existing Stream 
Enabled Application Install Block on tfie server to optimize subsequent 



Q 10 streaming of the application. 



The invention employs a Client Streaming File System that is used to 
manage specific application-related file accesses during the execution of an 
application- For example, there are certain shared library files, e,g,, "foo.dll", 
15 that need to be installed on the local file system, e.g., 
CI "c:\winnt\system32\foo.dir, for the application to execute. Such file names 

get added to a "spoof database". For the previous example, the spoof 
database would contain an entry saying that "c:\winnt\system32\foo.dir is 
mapped to "z:\word\winnt\system32\foo,dir where "z:" implies that this file is 
20 accessed by the Client Streaming File System. The Client Spoofer will then 
redirect all accesses to "c:\winnt\system32\foo.dir to 
"z:\word\winnt\system32\foo.dir. In this manner, the client system gets the 
effect of the file being on the local machine whereas in reality the file is 
streamed from the server. Several different classes of files can be treated in 
25 this way, e.g., specific application registry entries and application-based 
networking calls when such calls cross a firewall. 



Lastly, the Invention incorporates a number of software anti-piracy 
techniques directed at combating the piracy of applications of the type 
30 described herein that are delivered to the end-user over a network for 
execution on a client system. Among the anti-piracy techniques included 
are: client-side fine-grained filtering of file accesses directed at remotely 
served files; filtering of file accesses based on where the code for the 
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process that originated tlie request is stored; identification of crucial portions 
of application files and filtering file access depending on tlie portions of tfie 
application targeted; filtering of file accesses based on the surmised purpose 
of the file access, as determined by examining the program stack or flags 
associated with the request; and filtering of file accesses based on the 
surmised purpose of the file access, as determined by examining a history of 
previous file accesses by the same process. 

As mentioned above, the invention provides server and client technology for 
streaming application delivery and execution. The invention includes secure 
license-based streaming delivery of applications over 
Internet/extranets/intranets utilizing client-based execution with client 
caching and server-based file accesses by page. 

1. The invention provides many advantages over the present approaches, 
including: 

• Secure license-based streaming delivery over 
Internet/extranets/intranets: 

• reduces IT costs over client installation; 

• supports rental model of app delivery, which opens new markets and 
increases user convenience over purchase and client installation; and 

• enhances the opportunities to prevent software piracy over purchase 
and client installation. 

• Client-based execution with client caching: 

• increases typical application performance over server-based 
execution; 

• reduces network latency and bandwidth usage over non-cached 
client execution; and 

• allows existing applications to be run w/o rewrite/recompile/rebuild 
unlike other explicitly-distributed client/server application delivery 
approaches. 

15 
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• Server-based file accesses: 

• improve server-scaling over server-based execution; 

• allow transparent failover to another server whereas server-based 
5 execution does not; 

• make server load balancing easier than it is with server-based 
execution; and 

• allow increased flexibility in server platform selection over server- 
based execution. 
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• Server-based file accesses by page: 

• reduce network latency over complete file downloads; 
%i • reduce network bandwidth overhead over complete file downloads; 

U and 

Pi 15 • reduce client cache footprint over complete file downloads. 



2. Features of the Invention 

A) Server Components Supporting Application Delivery and Execution. 
20 I) referring to Fig. 1 , the server components include: 

a. Client/server network interface 110 that is common to the client 
113 and the server. This is the communication mechanism 
through which the client and the server communicate. 

25 

b. The Subscription Server 105 - This is the server the client 113 
connects to for subscribing and unsubscribing applications. This 
server then adds/deletes the subscription information to the 
Subscription Database 101 and also updates the License 

30 Database 102 with the information stating that the client 113 

can/cannot run the subscribed information under the agreed upon 
licensing terms. This communication between the client 113 and 
the Subscription Server 105 happens over SSL that is an industry 
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standard protocol for secure communication. The Subscription 
Server 105 is also contacted for updating any existing subscription 
information that is in the Subscription Database 101. 

c. The License Server 106 - This is the server the client 113 
connects to for getting a license to run an application after the 
client has subscribed to the application. This server validates the 
user and his subscriptions by consulting the License Database 
102. If the client 113 does have a valid license, the License Server 
106 sends an "Access token" to the client that is encrypted using 
an "encryption key" that the License Database 102 obtains from 
the Server Config Database 103. The "Access token" contains 
Information like the Application ID and an expiration time. Along 
with the "Access token," the License Server 106 also sends a list of 
least loaded application servers that It obtains from the Server 
Config Database 103 and also the expiration time that was 
encoded in the "Access token". The client 113 uses this expiration 
time to know when to ask for a new token. This communication 
between the client 113 and the License Server 106 happens over 
SSL. 

d. The Application Server 107 - Once the client 113 obtains an 
"Access token" to run an application, it connects to the Application 
Server 107 and presents to it the "Access token" along with the 
request for the application bits. Note that the "Access token" is 
opaque to the client 1 1 3 since it does not have the key to decrypt 
it. The Application Server 107 validates the "Access token" by 
decrypting it using a "decryption key" obtained from the Server 
Config Database 103 and checking the content against a 
predefined value like for example the Application ID and also by 
making sure that the expiration time in the "Access token" has not 
elapsed, it then serves the appropriate bits to the client 113 to 
enable It to run the application. The encryption and decryption 
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keys could be something like a private key/public key pair or a 
symmetric key or any other means of providing security. Note that 
the keys are uniform across all the servers within an ASP. 

e. The Monitor Server 108 - It monitors the load in terms of percent of 
CPU utilization on the Application Servers 107 and the License 
Servers 106 on a periodic basis (for example - every minute) and 
adds that information to the Server Config Database 103. 

f. The Profile Server 109 ~ It receives profile information from the 
clients periodically. It adds this information to the Profile Database 
104. The Profile Server 109 based on the profile information from 
different clients updates the App Prefetch Page List section of the 
Stream App Install Blocks 112. 

The data structures supporting the above server components 
include: 

a. Subscription Database 101 - This is the database that 
stores the user information in terms of username, list of apps 
subscribed, password, billing information, address, group, admin. 
The username is the primary key. The admin field identifies if this 
user has admin privileges for the group he belongs to. 

b. License Database 1 02 - This is the database that stores 
licensing information , /.a, which user can run what application 
and under which license. This database also keeps track of usage 
information , /.e,, which user has used which application for how 
long and how many times. The information looks like: 

Username, application, time of usage, number of times run 
Username, application, licensing policy 

Username, application, is app running, no of instances, time of 
start 
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The username is the primary key. The licensing policy could be 
something simple like expiry date or something more complicated 
like number of instances simultaneously allowed within a group 
etc. 

5 

c. Server Config Database 103 - This database stores 

information about which server can run which application, what is 
the load on all the servers, what is the encryption "key" to be used 
by the servers and all other information that Is needed by the 
1 0 servers. The information looks like: 

S Server IP address, App/Slim server, application list, current 

Ifl load 

fU Encryption key, Decryption key 

The Server IP address is the primary key for the first table. The 
f 1 5 keys are common across all servers. 

'i^ d. Profile Database 104 - This database stores the profile 

^ Information received by the profile server from the clients 

periodically. The information looks like: 
20 Application ID, File ID, Block ID number of hits 

The Application ID is the primary key. 

e. Application File Pages 11 1 - This is the one of the outputs 
of the "builder" as explained below and is put on the Application 

25 Server 1 07 so that it can serve the appropriate bits to the client. 

f. Stream App Install Blocks 112 - This is the other output of 
the "builder" and contains the information for successfully installing 
applications on the client for streaming applications. 
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B) Client Components Supporting Application Delivery & Execution 

i) With respect to Figs. 1 and 2, these client components include: 
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a. Client/Server Network interface 202 - This is the same 

interface as explained above. 



b. Client License Manager 205 - This component requests 
5 licenses ("Access tokens") from the License Server 106 when the 

client wants to run applications. The License Server 106 sends an 
"Access token" to the client that can be used to run the applications 
by presenting it to the Application Server 107. Along with the 
token, the License Server 106 also sends the expiry time of the 
□ 10 token. The Client License Manager 205 renews the token just 

g before the expiry period so that the client can continue running the 

^ application. When the application is complete, the Client License 

m Manager 205 releases the token by sending a message to the 

License Server 106. In addition, when a user has subscribed to an 
^ 1 5 application, the Client License Manager 205 first checks to make 

sure that the application is installed on the machine the user is 
f^j trying to run the application from and if not requests for the 

application installation. It does this using a list of Installed Apps 
that it maintains. 

20 

c. Client Cache Manager 207 - This component caches the 
application bits received from the Application Server 107 so that 
next time a request is made to the same bits, the request can be 
served by the cache instead of having to go to the Application 

25 Server 107. The Client Cache Manager 207 has a limited amount 

of space on the disk of the client machine that it uses for the cache. 
When the space Is fully occupied, the Client Cache Manager 207 
uses a policy to replace existing portions of the cache. This policy 
can be something like LRU, FIFO, random etc. The Client Cache 
30 Manager 207 is responsible for getting the application bits 

requested by the Client Streaming File System 212. If it does not 
have the bits cached, It gets them from the Application Server 107 
through the network interface. However it also need to get the 
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"Access token" from the Client License Manager 205 that it needs 
to send along with the request for the application bits. The Client 
Cache Manager 207 also updates the Prefetch History Info 209 
with the requests it receives from the Client Streaming File System 
5 212. 

d. Client Streaming File System 212 - This component serves 

all file system requests made by the application running on the 
client. The application makes calls like "read", "write" etc. to files 
1 0 that need to be streamed. These requests lead to page faults in the 

q operating system and the page faults are handled by the Client 

^ Streaming File System 212 that in turn asks the Client Cache 

III Manager 207 for the appropriate bits. The Client Cache Manager 

J'' 207 will send those bits from the cache if they exist there or 

15 forward the request to the Application Server 107 through the 

network interface to get the appropriate bits. 



e. Client Prefetcher 208 ~ This component monitors the 
requests made by the client to the Application Server 107 and 

20 uses heuristics to make additional requests to the Application 

Server 107 so that the bits can be obtained from the Application 
Server 107 before the client machine makes the request for them. 
This is mainly to hide the latency between the client and the 
Application Server 107. The history information of the requests is 

25 stored in the Prefetch History Info file 209. 

f. Client Profiler 203 - At specific time intervals, the client 
profiler sends the profile information, which is the Prefetch History 
Info to the prefetch server at the ASP that can then update the App 

30 Prefetch Page Lists for the different applications accordingly. 

g. Client File Spoofer 21 1 - Certain files on the client need to 
be installed at specific locations on the client system. To be able to 
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stream these files from the Application Server 107, the Client 
Spoofer 211 intercepts all requests to these files made by a 
running application and redirects them to the Client Streaming File 
System 212 so that the bits can be streamed from the Application 
Server 107. 

h. Client Registry Spoofer 213 - Similar to files, certain 
registry entries need to be different when the application being 
streamed is running and since it is undesirable to overwrite the 
existing registry value, the read of the registry value is redirected to 
the Client Registry Spoofer 215 which returns the right value. 
However, this is optional as it is very likely that overwriting the 
existing registry value will make the system work just fine. 

i. Client Network Spoofer 213 - Certain applications make 
networking calls through a protocol like TCP. To make these 
applications work across firewalls, these networking calls need to 
be redirected to the Client Network Spoofer 213 which can tunnel 
these requests through a protocol like HTTP that works through 
firewalls. 

The data structures needed to support the above client 
components include: 

a. File Spoof Database 210 - The list of files the requests to 
which need to be redirected to the Client Streaming File System 
212. This Information looks like (The source file name is the 
primary key) 

Source File Name, Target File Name 

b. Registry Spoof Database 216 - List of registry entries and 
their corresponding values that need to be spoofed. Each entry 
looks like: 
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Registry entry, new value 



c. Network Spoof Database 214 - Like of IP addresses, the 
networking connections to which need to be redirected to the 
Client Network Spoofer 213. Each entry looks like (IP address Is 
the primary key): 

IP address, Port number, new IP address, new Port number 

d. Client Stream Cache 206 - The on-disk cache that 
persistently stores application bits. 

e. Known ASPs and Installed Apps 204 - The list of ASP 
servers (Application, License and Subscription) and also the list of 
applications that are installed on the client. 

f. Prefetch History Info 209 - The history of the requests made 
to the cache. This consists of which blocks were requested from 
which file for which application and how many times each block 
was requested. It also consists of predecessor-successor 
information indicating which block got requested after a particular 
block was requested. 

C) Client Application Installation 

25 Referring to Fig. 3, the client application installation components include: 

i) Client License Manager 303 - This is the same component 
explained above. 

30 ii) Client Application Installer 305 - This component is invoked when 
the application needs to be Installed. The Client Application Installer 
305 sends a specific request to the Application Server 107 for getting 
the Stream App Install Block 301 for the particular application that 
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needs to be installed. The Stream App Install Block 301 consists of 
the App Prefetch Page List 306, Spoof Database 308, 309, 310, and 
App Registry Info 307. The Client Application Installer 305 then 
updates the various Spoof Databases 308, 309, 310 and the Registry 
5 307 with this information, it also asks the Client Prefetcher 208 to start 

fetching pages in the App Prefetch Page List 306 from the Application 
Server 107. These are the pages that are known to be needed by a 
majority of the users when they run this application. 

10 D) Application Stream Builder Input/Output 

^-1 With respect to Fig. 4, the Builder components include the following: 

i) Application Install Monitor 403 - This component monitors the 
15 installation of an application 401 and figures out all the files that have 

[3 been created during installation 402, registry entries that were created 

;ij and all the other changes made to the system during installation. 

ii) Application Profiler 407 - After the application is installed, it is 
20 executed using a sample script. The Application Profiler 407 monitors 

the application execution 408 and figures out the application pages 
that got referenced during the execution. 

iii) App Stream Packager 404 ~ The App Stream Packager 404 takes 
25 the information gathered by the Application Profiler 407 and the 

Application Install Monitor 403 and forms the Application File Pages 
406 and the Stream App Install Block 405 from that information. 

E) Network Spoofing for client-server applications: 
30 Referring to Figs. 1 , 4, 5a, 5b, and 6a, the component that does the 

Network Spoofing is the TCP to HTTP converter 503, 507. The 
basic idea is to take TCP packets and tunnel them through HTTP 
on one side and do exactly the opposite on the other. As far as the 
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client 501 and the server 502 are concerned the communication is 
TCP and so existing applications that run with that assumption 
work unmodified. This is explained in more detail below. 



5 On the client side, the user launches an application that resides on the Client 
Streaming File System. That application may be started In the same ways 
that applications on other client file systems may be started, e.g., opening a 
data file associated with the application or selecting the application from the 
Start/Programs menu in a Windows system. From the point of view of the 
10 client's operating system and from the point of view of the application itself, 



g| that application is located locally on the client. 

M Whenever a page fault occurs on behalf of any application file residing on 

iff- 

.r the Client Streaming File System 604, that file system requests the page 

ft 1 5 from the Client Cache Manager 606. The Client Cache Manager 606, after 
Q ensuring via Interaction with the Client License Manager 608 that the user's 

Q client system holds a license to run the application at the current time, 

checks the Client Stream Cache 611 and satisfies the page fault from that 
cache, if possible. If the page is not currently in the Client Stream Cache 
20 611, the Client Cache Manager 606 makes a request to the Client/Server 
Network Interface 505, 609 to obtain that page from the Application File 
Pages stored on an Application Server 506. 

The Client Prefetcher 606 tracks all page requests passed to the Client 
25 Cache Manager 606. Based on the pattern of those requests and on 
program locality or program history, the Client Prefetcher 606 asks the Client 
Cache Manager 606 to send additional requests to the Client/Server 
Network Interface 505, 609 to obtain other pages from the Application File 
Pages stored on the Application Server 506. 
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Files located on the Client Streaming File System 604 are typically identified 
by a particular prefix (like drive letter or pathname). However, some files 
whose names would normally imply that they reside locally are mapped to 
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the Client Streaming File System 604, in order to lower the invention's 
impact on the user's local configuration. For instance, there are certain 
shared library files (dll's) that need to be installed on the local file system 
(c:\winnt\system32\foo.dll). It is undesirable to add that file on the user's 
5 system. The file name gets added to a "spoof database" which contains an 
entry saying that c:\winnt\system32\foo.dll is mapped to 
z:\word\winnt\system32\foo.dll where z: Implies that it is the Client Streaming 
File System. The Client Spoofer 603 will then redirect all accesses to 
c:\wlnnt\system32\foo.dll to z:\word\winnt\system32\foo.dll. In this manner the 
1 0 client system gets the effect of the file being on the local machine whereas in 
reality the file is streamed from the server. 



m In a similar fashion the Client Spoofer 603 may also be used to handle 

mapping TCP interfaces to HTTP interfaces. There are certain client-server 
1^ 1 5 applications (like ERP/CRM applications) that have a component running on 
a client and another component running on a database server, Web server 
etc. These components talk to each other through TCP connections. The 
client application will make TCP connections to the appropriate server (for 
this example, a database server) when the client piece of this application is 
20 being streamed on a user's machine. 

The database server could be resident behind a firewall and the only way for 
the client and the server to communicate is through a protocol like HTTP that 
can pass through firewalls. To enable the client to communicate with the 

25 database server, the client's TCP requests need to be converted to HTTP 
and sent to the database server. Those requests can be converted back to 
TCP so that the database server can appropriately process the requests just 
before the requests reach the database server. The Client Spoofer's 603 
responsibility in this case is to trap all TCP requests going to the database 

30 server and convert it into HTTP requests and take all HTTP requests coming 
from the database server and convert them into TCP packets. Note that the 
TCP to HTTP converters 505, 507 convert TCP traffic to HTTP and vice versa 
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by embedding TCP packets within the HTTP protocol and by extracting the 
TCP packets from the HTTP traffic. This is called tunneling. 



When the Client License Manager 608 is asked about a client's status with 
5 respect to holding a license for a particular application and the license is not 
already being held, the Client License Manager 608 contacts the License 
Server 106 via the Client/Server Network Interface 609 and asks that the 
client machine be given the license. The License Server 106 checks the 
Subscription 101 and License 102 Databases and, if the user has the right to 
10 hold the license at the current time, it sends back an Access Token, which 
O represents the right to use the license. This Access Token is renewed by the 

client on a periodic basis. 

- The user sets up and updates his information in the Subscription 101 and 

Q 15 License 102 Databases via interacting with the Subscription Server 105. 

m 



Whenever a user changes his subscription Information, the Subscription 



Q Server 105 signals the user's client system since the client's Known ASPs 

^ and Installed Apps information potentially needs updating. The client system 

also checks the Subscription 101 and License 102 Databases whenever the 
20 user logs into any of his client systems set up for Streaming Application 
Delivery and Execution. If the user's subscription list in the Subscription 101 
and License 102 Databases list applications that have not been installed on 
the user's client system, the user is given the opportunity to choose to install 
those applications. 

25 

Whenever the user chooses to install an application, the Client License 
Manager 608 passes the request to the Client Application Installer 607 along 
with the name of the Stream App Install Block to be obtained from the 
Application Server 107. The Client Application Installer 607 opens and 
30 reads that file (which engages the Client Streaming File System) and 
updates the Client system appropriately, including setting up the spoof 
database, downloading certain needed non-application-specific files, 
modifying the registry file, and optionally providing a list of applications 



27 



pages to be prefetched to warm up the Client Stream Cache 611 with 
respect to the application. 



The Application Stream Builder creates the Stream App Install Block 405 
5 used to set up a client system for Streaming Application Delivery and 
Execution and it also creates the set of Application File Pages 406 sent to 
satisfy client requests by the Application Server 107. The process that 
creates this information is offline and involves three components. The 
Application Install Monitor 403 watches a normal installation of the 
10 application and records various information including registry entries, 
required system configuration, file placement, and user options. The 
Application Profiler 407 watches a normal execution of the application and 
records referenced pages, which may be requested to pre-warm the client's 
cache on behalf of this application. The Application Stream Packager 404 
15 takes information from the other two Builder components, plus some 
d Information it compiles with respect to the layout of the Installed application 

and forms the App Install Block 405 and the set of Application File Pages 
406. 
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20 Server fail-over and server quality of service problems are handled by the 
client via observation and information provided by the server components. 
An ASP'S Subscription Server provides a list of License Servers associated 
with that ASP to the client, when the user initiates/modifies his account or 
when the client software explicitly requests a new list. A License Server 

25 provides a list of Application Servers associated with an application to the 
client, whenever it sends the client an Access Token for the application. 

Should the client observe apparent non-response or slow response from an 
Application Server, it switches to another Application Server in its list for the 
30 application in question. If none of the Application Servers in its list respond 
adequately, the client requests a new set for the application from a License 
Server, The strategy is similar in the case in which the client observes 
apparent non-response or slow response from a License Server; the client 
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switches to another License Server in its list for the ASP in question. If none 
of the License Servers in its list responds adequately, the client requests a 
new set of License Servers from the ASP. 



un 
si 



5 Server load balancing is handled by the server components In cooperation 
with the client. A server monitor component tracks the overall health and 
responsiveness of all servers. When a server is composing one of the server 
lists mentioned in the previous paragraph, it selects a set that is alive and 
relatively more lightly used than others. Client cooperation is marked by the 
10 client using the server lists provided by the servers in the expected way, and 
not unilaterally doing something unexpected, like continuing to use a server 
which does not appear in the most recent list provided. 

Security Issues associated with the server client relationship are considered 
ft 15 in the invention. To ensure that the communication between servers and 
O clients is private and that the servers in question are authorized via 

appropriate certification, an SSL layer is used. To ensure that the clients are 
licensed to use a requested application, user credentials 
(username+password) are presented to a License Server, which validates 
20 the user and his licensing status with respect to the application in question 
and issues an Access Token, and that Access Token is in turn presented to 
an Application Server, which verifies that the Token's validity before 
delivering the requested page. Protecting the application in question from 
piracy on the client's system is discussed in another section, below. 
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CLIENT-SIDE PERFORMANCE OPTIMIZATION 



This section focuses on client-specific portions of the invention. The 
invention may be applied to any operating system that provides a file system 
30 interface or block driver interface. A preferred embodiment of the Invention 
is Windows 2000 compliant. 
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With respect to Fig. 6a, several different components of the client software 
are shown. Some components will typically run as part of the operating 
system kernel, and other portions will run in user mode. 

5 The basis of the client side of the streamed application delivery and 
execution system Is a mechanism for making applications appear as though 
they were installed on the client computer system without actually installing 
them. 

Installed applications are stored in the file system of the client system as files 
organized in directories. In the state of the art, there are two types of file 
systems: local and network. Local file systems are stored entirely on media 
(disks) physically resident in the client machine. Network file systems are 
stored on a machine physically separate from the client, and all requests for 
data are satisfied by getting the data from the server. Network file systems 
are typically slower than local file systems. A traditional approach to use the 
better performance of a local file system Is to Install important applications on 
the local file system, thereby copying the entire application to the local disk. 
The disadvantages of this approach are numerous. Large applications may 
take a significant amount of time to download, especially across slower wide 
area networks. Upgrading applications is also more difficult, since each 
client machine must individually be upgraded. 

The invention eliminates these two problems by providing a new type of file 
25 system: a streaming file system. The streaming file system allows 
applications to be run immediately by retrieving application file contents from 
the server as they are needed, not as the application Is installed. This 
removes the download cost penalty of doing local installations of the 
application. The streaming file system also contains performance 
30 enhancements that make it superior to running applications directly from a 
network file system. The streaming file system caches file system contents 
on the local machine. File system accesses that hit in the cache are neariy 
as fast as those to a local file system. The streaming file system also has 
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30 



sophisticated information about application file access patterns. By using 
this knowledge, the streaming file system can request portions of application 
files from the server in advance of when they will actually be needed, thus 
further Improving the performance of applications running on the application 
5 streaming file system. 
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In a preferred embodiment of the invention, the application streaming file 
system is implemented on the client using a file system driver and a helper 
application running in user mode. The file system driver receives all 



Q 10 requests from the operating system for files belonging to the application 
Q streaming file system. The requests it handles are all of the standard file 

"f: system requests that every file system must handle, including (but not limited 

to) opening and closing files, reading and writing files, renaming files, and 
deleting files. Each file has a unique identifier consisting of an application 
15 number, and a file number within that application. In one embodiment of the 
invention, the application number Is 128 bits and the file number Is 32 bits, 
Q resulting in a unique file ID that is 160 bits long. The file system driver is 

responsible for converting path names (such as "z:\program files\foo.exe") 
into file IDs (this Is described below). Once the file system driver has made 
20 this translation, it basically forwards the request to the user-mode program to 
handle. 



The user-mode program is responsible for managing the cache of 
application file contents on the local file system and contacting the 

25 application streaming server for file contents that it cannot satisfy out of the 
local cache. For each file system request, such as read or open, the user- 
mode process will check to see if it has the requested information in the 
cache. If it does, it can copy the data from the cache and return It to the file 
system driver. If it does not, it contacts the application streaming server over 

30 the network and obtains the information it needs. To obtain the contents of 
the file, the user-mode process sends the file identifier for the file it is 
interested in reading along with an offset at which to read and the number of 
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bytes to read. The application streaming server will send back the requested 
data. 
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The file system can be implemented using a fragmented functionality to 
facilitate development and debugging. AH of the functionality of the user- 
mode component can be put into the file system driver itself without 
significantly changing the scope of the invention. Such an approach is 
believed to be preferred for a client running Windows 95 as the operating 
system. 



Directories are specially formatted files. The file system driver reads these 
from the user mode process just like any other files with reads and writes. 
Along with a header containing information about the directory (such as how 
€l long It is), the directory contains one entry for each file that it contains. Each 

15 entry contains the name of the file and its file identifier. The file identifier is 
5^ necessary so that the specified file can be opened, read, or written. Note 

2 that since directories are files, directories may recursively contain other 

u directories. All files in an application streaming file system are eventual 

descendents of a special directory called the "root". The root directory is 
20 used as the starting point for parsing file names. 



Given a name like "zr/foo/bar/baz", the file system driver must translate the 
path "z:/foo/bar/baz" into a fife identifier that can be used to read the file from 
the application streaming service. First, the drive letter is stripped off, leaving 

25 7foo/bar/baz". The root directory will be searched for the first part of the path, 
in this case "foo". If the file "foo" is found in the root directory, and the file 
"foo" is a directory, then "foo" will be searched for the next portion of the path, 
"bar". The file system driver achieves this by using the file id for "foo" (found 
by searching the root directory) to open the file and read its contents. The 

30 entries inside "foo" are then searched for "bar"*, and this process continues 
until the entire path is parsed, or an error occurs. 
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In the following examples and text, the root directory is local and private to 
the client. Each application that is Installed will have its own special 
subdirectory in the root directory. This subdirectory will be the root of the 
application. Each application has its own root directory. 

5 

The invention's approach is much more efficient than other approaches like 
the standard NFS approach. In those cases, the client sends the entire path 
7foo/bar/baz" to the server and the server returns the file id for that file. The 
next time there is a request for 7foo/bar/baz2" the entire path again needs to 
10 be sent. In the approach described here, once the request for "baf was 
made, the file ids for all files within bar are sent back including the ids for 
"baz" and "baz2" and hence "baz2" will already be known to client. This 
M reduces communication between the client and the server. 

1 5 In addition, this structure also allows applications to be easily updated. If 
O certain code segments need to be updated, then the code segment listing in 

g the application root directory is simply changed and the new code segment 

subdirectory added. This results in the new and correct code segment 
subdirectory being read when it is referenced. For example if a file by the 
20 name of 7foo/bar/baz3" needs to be added, the root directory is simply 
changed to point to a new version of '*foo" and that new version of "foo" 
points to a new version of "bar" which contains "baz3" in addition to the files 
it already contained. However the rest of the system is unchanged. 



25 Client Features 



Referring to Figs. 6a and 6b, a key aspect of the preferred embodiment of the 
Invention is that application code and data are cached In the client's 
persistent storage 616, 620. This caching provides better performance for 
30 the client, as accessing code and data in the client's persistent storage 620 
is typically much faster than accessing that data across a wide area network. 
This caching also reduces the load on the server, since the client need not 
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retrieve code or data from the application server that it already has in its local 
persistent storage. 



In order to run an application, its code and data must be present in the client 
5 system*s volatile storage 619. The client software maintains a cache of 
application code and data that normally reside in the client system*s 
nonvolatile memory 620. When the running application requires data that is 
not present in volatile storage 619, the client streaming software 604 Is 
asked for the necessary code or data. The client software first checks its 
^ 10 cache 611, 620 in nonvolatile storage for the requested code or data. If it is 
O found there, the code or data are copied from the cache in nonvolatile 

storage 620 to volatile memory 619. If the requested code or data are not 
found in the nonvolatile cache 611, 620, the client streaming software 604 
will acquire the code or data from the server system via the client's network 



1 5 interface 621 , 622 



Application code and data may be compressed 623, 624 on the server to 
S provide better client performance over slow networks. Network file systems 

typically do not compress the data they send, as they are optimized to 
20 operate over local area networks. 



Figs. 7a & 7b demonstrate two ways in which data may be compressed while 
in transit between the server and client. With either mechanism, the client 
may request multiple pieces of code and data from multiple files at once. 

25 Fig. 7A illustrates the server 701 compressing the concatenation of A, B, C, 
and D 703 and sending this to the client 702. Fig. 7B illustrates the server 
706 separately compressing A, B, C, and D 708 and sending the 
concatenation of these compressed regions to the client 707. In either case, 
the client 702, 707 will decompress the blocks to retrieve the original 

30 contents A, B, C, and D 704, 709 and these contents will be stored in the 
cache 705, 710. 
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The boxes marked "Compression" represent any method of making data 
more compact, including software algorithms and hardware. The boxes 
marked "Decompression" represent any method for expanding the 
compacted data, including software algorithms and hardware. The 
5 decompression algorithm used must correspond to the compression 
algorithm used. 



The mechanism for streaming of application code and data may be a file 
system. Many network fife systems exist. Some are used to provide access 
g 10 to applications, but such systems typically operate well over a local area 
network (LAN) but perform poorly over a wide area network (WAN). While 
this solution Involves a file system driver as part of the client streaming 
f -y software, it is more of an application delivery mechanism than an actual file 

system. 
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With respect to Fig. 8, application code and data are installed onto the file 
system 802, 805, 806, 807 of a client machine, but they are executed from 
the volatile storage (main memory). This approach to streamed application 
delivery involves installing a special application streaming file system 803, 

20 804. To the client machine, the streaming file system 803, 804 appears to 
contain the installed application 801 . The application streaming file system 
803 will receive all requests for code or data that are part of the application 
801. This file system 803 will satisfy requests for application code or data by 
retrieving it from its special cache stored in a native file system or by 

25 retrieving it directly from the streaming application server 802. Code or data 
retrieved from the server 802 will be placed In the cache in case It Is used 
again. 



Referring to Fig. 9, an alternative organization of the streaming client 
30 software is shown. The client software is divided into the kernel-mode 
streaming file system driver 905 and a user-mode client 902. Requests 
made to the streaming file system driver 905 are all directed to the user- 
mode client 902, which handles the streams from the application streaming 
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server 903 and sends, the results back to the driver 905. The advantage of 
this approach is that it is easier to develop and debug connpared with the 
pure-kernel mode approach. The disadvantage is that the performance will 
be worse than that of a kernel-only approach. 

5 

As shown in Figs. 10 and 11, the mechanism for streaming of application 
code and data may be a block driver 1004, 1106. This approach is an 
alternative to that represented by Figs. 8 and 9. 

!f: 10 With respect to Fig. 10, the application streaming software consists of a 

t ;i streaming block driver 1004. This block driver 1004 provides the abstraction 

{/J of a physical disk to a native file system 1003 already installed on the client 

^ operating system 1002. The driver 1004 receives requests for physical block 

j;i reads and writes, which it satisfies out of a cache on a standard file system 

^ 15 1003 that IS backed by a physical disk drive 1006, 1007. Requests that 
cannot be satisfied by the cache go to the streaming application server 1005, 

U 

ffi as before. 

O 

SB*? 

Referring to Fig. 1 1 , the application streaming software has been divided into 
20 a disk driver 1 106 and a user mode client 1 102. In a manner similar to that 
of Fig. 9, the disk driver 1106 sends all requests it gets to the user-mode 
client 1 102, which satisfies them out of the cache 1 107, 1 108 or by going to 
the application streaming server 1 1 03. 

25 The persistent cache may be encrypted with a key not permanently stored on 
the client to prevent unauthorized use or duplication of application code or 
data. Traditional network file systems do not protect against the 
unauthorized use or duplication of file system data. 

30 With respect to Fig. 12, unencrypted and encrypted client caches. A, B, C, 
and D 1201 representing blocks of application code and data in their natural 
form are shown. E,^(X) represents the encryption of block X with key k 1202. 
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Any encryption algorithm may be used. The key k is sent to the client upon 
application startup, and it is not stored in the application's persistent storage. 



Client-initiated prefetching of application code and data helps to improve 
5 interactive application performance. Traditional network file systems have 
no prefetching or simple locality based prefetching. 

Referring to Fig. 13, the application 1301 generates a sequence of code or 
data requests 1302 to the operating system(OS) 1303. The OS 1303 directs 
Q 10 these 1304 to the client application streaming software 1305. The client 
O software 1305 will fetch the code or data 1306 for any requests that do not hit 

in the cache from the server 1307, via the network. The client software 1305 
f|f inspects these requests and consults the contents of the cache 1309 as well 

as historic information about application fetching patterns 1308. It will use 
15 this information to request additional blocks of code and data that it expects 
will be needed soon. This mechanism is referred to as "pull prefetching." 

Server-initiated prefetching of application code and data helps to improve 
interactive application performance. Traditional network file systems have 
20 no prefetching or simple locality based prefetching. 

With respect to Fig. 14, the server-based prefetching is shown. As in Fig. 13, 
the client application streaming software 1405 makes requests for blocks 
1407 from the application streaming server 1408. The server 1408 
25 examines the patterns of requests made by this client and selectively returns 
to the client additional blocks 1406 that the client did not request but is likely 
to need soon. This mechanism is referred to as "push prefetching." 

A client-to-client communication mechanism allows local application 
30 customization to travel from one client machine to another without involving 
server communication. Some operating systems have a mechanism for 
copying a user's configuration and setup to another machine. However, this 
mechanism typically doesn't work outside of a single organization's network, 
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and usually will copy the entire environment, even if only the settings for a 
single application are desired. 

Referring to Fig. 15, a client-to-client mechanism is demonstrated. When a 
user wishes to run an application on a second machine, but wishes to retain 
customizations made previously on the first, the client software will handle 
this by contacting the first machine to retrieve customized files and other 
customization data. Unmodified files will be retrieved as usual from the 
application streaming server. 

Here, File 4 exists in three different versions. The server 1503 provides one 
version of this file 1506, client 1 1501 has a second version of this file 1504, 
and client 2 1502 has a third version 1505. Files may be modified differently 
for each client. 

The clients may also contain files not present on the server or on other 
clients. File 5 1507 Is one such file; it exists only on client 1 1501. File 6 
1508 only exists on client 2 1502. 

Local Customization 

A local copy-on-write file system allows some applications to write 
configuration or initialization files where they want to without rewriting the 
application, and without disturbing the local customization of other clients. 
Installations of applications on file servers typically do not allow the 
installation directories of applications to be written, so additional 
reconfiguration or rewrites of applications are usually necessary to allow 
per-user customization of some settings. 

With respect to Fig. 16, the cache 1602 with extensions for supporting local 
file customization is shown. Each block of data in the cache is marked as 
"clean" 1604 or "dirty" 1605. Pages marked as dirty have been customized 
by the client 1609, and cannot be removed from the cache 1602 without 
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losing client customization. Pages marked as clean may be purged from the 
cache 1602, as they can be retrieved again from the server 1603. The index 
1601 indicates which pages are clean and dirty. In Fig. 16, clean pages are 
white, and dirty pages are shaded. File 1 1606 contains only clean pages, 
and thus may be entirely evicted from the cache 1602. File 2 1607 contains 
only dirty pages, and cannot be removed at all from the cache 1602. File 3 
1608 contains some clean and some dirty pages 1602. The clean pages of 
File 3 1608 may be removed from the cache 1602, while the dirty pages 
must remain. 

Selective Write Protection 



The client streaming software disallows modifications to certain application 
files. This provides several benefits, such as preventing virus infections and 

15 reducing the chance of accidental application corruption. Locally installed 
files are typically not protected in any way other than conventional backup. 
Application file servers may be protected against writing by client machines, 
but are not typically protected against viruses running on the server itself. 
Most client file systems allow flies to be marked as read-only, but it is 

20 typically possible to change a file from read-only to read-write. The client 
application streaming software will not allow any data to be written to files 
that are marked as not modifiable. Attempts to mark the file as writeable will 
not be successful. 

25 Error Detection and Correction 

The client streaming software maintains checksums of application code and 
data and can repair damaged or deleted flies by retrieving another copy from 
the application streaming server. Traditional application delivery 
30 mechanisms do not make any provisions for detecting or correcting 
corrupted application installs. The user typically detects a corrupt 
application, and the only solution is to completely reinstall the application. 
Corrupt application files are detected by the invention automatically, and 
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replacement code or data are invisibly retrieved by the client streaming 
software without user Intervention. 



If! 



When a block of code or data is requested by the client operating system, the 
5 client application streaming software will compute the checksum of the data 
block before it is returned to the operating system. If this checksum does not 
match that stored in the cache, the client will invalidate the cache entry and 
retrieve a fresh copy of the page from the server. 

1 0 File Identifiers 

Applications may be patched or upgraded via a change in the root directory 
for that application. Application files that are not affected by the patch or 
upgrade need not be downloaded again. Most existing file systems do not 
15 cache files locally. 



O Each file has a unique identifier (number). Files that are changed or added 

in the upgrade are given new identifiers never before used for this 
application. Files that are unchanged keep the same number. Directories 
20 whose contents change are also considered changes. If any file changes, 
this will cause its parent to change, all the way up to the root directory. 



Upgrade Mechanism 



25 When the client is informed of an upgrade, it is told of the new root directory. 
It uses this new root directory to search for files in the application. When 
retrieving an old file that hasn't changed, it will find the old file identifier, 
which can be used for the existing files in the cache. In this way, files that do 
not change can be reused from the cache without downloading them again. 

30 For a file that has changed, when the file name is parsed, the client will find a 
new file number. Because this file number did not exist before the upgrade, 
the client wilt not have this file in the cache, and will stream the new file 
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contents when the file is freshly accessed. This way it always gets the 
newest version of files that change. 



The client application streaming software can be notified of application 
5 upgrades by the streaming application server. These upgrades can be 
marked as mandatory, in which case the client software will force the 
application to be upgraded. 



The client will contact the application streaming server when it starts the 
□ 10 application. At this time, the streaming application server can inform the 
S client of any upgrades. If the upgrade is mandatory, the client will be 

informed, and it will automatically begin using the upgraded application by 
using the new root directory. 



m 



1 5 Multicast Technique 



^ A broadcast or multicast medium may. be used to efficiently distribute 

applications from one application streaming server to multiple application 
streaming clients. Traditional networked application delivery mechanisms 

20 usually involve Installing application code and data on a central server and 
having client machines run the application from that server. The multicast 
mechanism allows a single server to broadcast or multicast the contents of 
an application to many machines simultaneously. The client machines will 
receive the application via the broadcast and save it in their local disk cache. 

25 The entire application can be distributed to a large number of client 
machines from a single server very efficiently. 



The multicast network Is any communication mechanism that has broadcast 
or multicast capability. Such media include television and radio broadcasts 
30 and IP multicasting on the Internet. Each client that is interested in a 
particular application may listen to the multicast media for code and data for 
that application. The code and data are stored in the cache for later use 
when the application is run. 
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These client techniques can be used to distribute data that changes rarely. 
Application delivery is the most appealing use for these techniques, but they 
could easily be adopted to distribute other types of slowly changing code 
and data, such as static databases. 

LOAD BALANCING AND FAULT TOLERANCE FOR STREAMED 

APPLICATIONS 

This section focuses on load balancing (and thereby scalability) and 
hardware fail over. Throughout this discussion reference should be made to 
Fig. 17. Load balancing and fault tolerance are addressed in the invention 
by using a smart client and smart server combination. A preferred 
embodiment of the invention that implements these features includes three 
types of servers (described below): app servers; SLM servers; and an ASP 
Web server. These are organized as follows: 

1: ASP Web server 1703 - This is the Web server that the user goes 
to for subscribing to applications, creating accounts etc. 
Compared to the other two types of servers it is characterized 
by: lowest traffic, fewest number of them, & least likely to go 
down. 

2: SLM Servers 1707 - subscription license manager sen/ers - These 
keep track of which user has subscribed to what applications 
under what license etc. Compared to the other two types of 
sen/ers it is characterized by: medium traffic, manageable 
number, and less likely to go down. 

3: App Servers 1710 - These are the servers to which the users go to 
for application pages. Compared to the other two types of 
servers it is characterized by: highest traffic, most number of 
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them, most likely to go down either due to hardware failure 
or application re-configu ration. 

Server Lists 

Clients 1704 subscribe and unsubscribe to applications via the ASP Web 
server 1703. At that point, instead of getting a primary and a secondary 
server that can perform the job, the ASP Web server 1703 gives them a non- 
prioritized list of a large number of SLM servers 1706 that can do the job. 
When the application starts to run, each client contacts the SLM servers 
1707, 1708, 1709 and receive its application server list 1705 that can serve 
the application in question and also receive the access tokens that can be 
used to validate themselves with the application servers 1710-1715. All 
access tokens have an expiration time after which they need to be renewed. 

Server Selection 

Having gotten a server list for each type of server 1705, 1 706, the client 1 704 
will decide which specific server to send its request to. In a basic 
implementation, a server is picked randomly from the list, which will 
distribute the client* s load on the servers very close to evenly. An alternative 
preferred implementation will do as follows: 

a) Clients will initially pick servers from the list randomly, but they 
will also keep track of the overall response time they get from 
each request; and 

b) As each client learns about response times for each server, it 
can be more intelligent (rather than random) and pick the most 
responsive server. It is believed that the client is better suited 
at deciding which server is most responsive because it can 
keep track of the round trip response time. 
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Client-side Hardware Fail Over 
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The server selection logic provides hardware failover in the following 
manner: 

a) If a server does not respond, /.e., tinnes out, the client 1704 will pick 
another server from its list 1705, 1706 and re-send the request. Since all 
the servers in the client's server list 1705, 1706 are capable of 
processing the client's request, there are no questions of incompatibility. 



b) If a SAS client 1704 gets a second time out, Le., two servers are down, it 

ijj re-sends the request to multiple servers from its list 1705, 1706 in 

^ parallel. This approach staggers the timeouts and reduces the overall 

yS delay in processing a request. 



c) In case of a massive hardware failure, all servers in the client's list 1705, 
1706 may be down. At this point, the client will use the interfaces to 
refresh its server list. This Is where the three tiers of servers become 
important: 



1) If the client 1704 want to refresh its App server list 1705, it 
will contact an SLM server 1707, 1709 in its list of SLM 
servers 1706. Again, the same random (SLM) server 
selection order is utilized here. Most of the time, this request 

25 wilt be successful and the client will get an updated list of 

app servers. 

2) If for some reason all of the SLM servers 1707, 1709 in the 
client's list 1706 are down, it will contact the ASP Web 

30 server 1703 to refresh Its SLM server list 1706. 
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This 3-tiered approach significantly reduces the impact of a single point of 
failure - the ASP Web server 1703, effectively making it a fail over of a fail 
over. 



5 Server Load Balancing 
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In a preferred embodiment of the invention, a server side monitor 1702 
keeps track of the overall health and response times for each server request. 
The Monitor performs this task for all Application and SLM servers. It posts 
10 prioritized lists of SLM servers and app servers 1701 that can serve each of 
the apps in a database shared by the monitor 1702 and all servers. The 
monitor's algorithm for prioritizing server lists is dominated by the server's 
response time for each client request. If any servers fail, the monitor 1702 
informs the ASP 1703 and removes it from the server list 1701. Note that the 
15 server lists 1705, 1706 that the client 1704 maintains are subsets of lists the 
h monitor 1702 maintains in a shared database 1701. 

M Since all servers can access the shared database 1701, they know how to 

'cuf a list of servers to a client. For example, the client starts to run an SAS 
20 application or it wants to refresh its app server list: It will contact an SLM 
server and the SLM server will access the database 1701 and cut a list of 
servers that are most responsive (from the server's prospective). 



In this scheme, the server monitor 1702 is keeping track of what it can track 
25 the best: how effectively servers are processing client requests (server's 
response time). It does not track the network propagation delays etc. that can 
significantly contribute to a client's observed response time. 



ASP Managing Hardware Failovers 

30 

The foregoing approaches provide an opportunity for ASPs to better manage 
massive scale failures. Specifically, when an ASP 1703 realizes that 
massive numbers of servers are down, it can allocate additional resource on 
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a temporary basis. The ASP 1703 can update the central database 1701 
such that clients will receive only the list that the ASP 1703 knows to be up 
and running. This includes any temporary resources added to aid the 
situation. A particular advantage of this approach Is that ASP 1 703 doesn't 
5 need special actions, e.g., emails or phone support, to route clients over to 
these temporary resources; the transition happens automatically. 

Handling Client Crashes and Client Evictions 

5! 10 To prevent the same user from running the same application from multiple 
Q machines, the SLM servers 1707, 1708, 1709 track what access tokens have 

been handed to what users. The SAS file system tracks the beginning and 
end of applications. The user's SAS client software asks for an access token 
from the SLM servers 1707, 1708, 1709 at the beginning of an application if 
15 it already does not have one and it releases the access token when the 
application ends. The SLM server makes sure that at a given point only one 
access token has been given to a particular user. In this manner, the user 
can run the application from multiple machines, but only from one at a 
particular time. However, if the user's machine crashes before the access 
20 token has been relinquished or if for some reason the ASP 1703 wants to 
evict a user, the access token granted to the user must be made invalid. To 
perform this, the SLM server gets the list of application servers 1705 that 
have been sent to the client 1704 for serving the application and sends a 
message to those application servers 1710, 1711, 1713, 1714 to stop 
25 serving that particular access token. This list is always maintained in the 
database so that every SLM server can find out what list is held by the user's 
machine. The application servers before servicing any access token must 
check with this list to ensure that the access token has not become invalid. 
Once the access token expires, it can be removed from this list. 
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SERVER-SIDE PERFORMANCE OPTIMIZATION 
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This section describes approaches that can be taken to reduce client-side 
latency (the time between when an application page is needed and when it 
is obtained) and improve Application Server scalability (a measure of the 
number of servers required to support a given population of clients). The 
5 former directly affects the perceived performance of an application by an end 
user (for application features that are not present in the user's cache), while 
the latter directly affects the cost of providing application streaming services 
to a large number of users. 

10 Application Server Operation 



The basic purpose of the Application Server is to return Application File 
Pages over the network as requested by a client. The Application Server 
y3 holds a group of Stream Application Sets from which it obtains the 

U 15 Application File Pages that match a client request. The Application Server is 
analogous to a typical network file system (which also returns file data), 
except it is optimized for delivery of Application file data, Ae., code or data 
C that belong directly to the application, produced by the software provider, as 

opposed to general user file data (document files and other content 
20 produced by the users themselves). The primary differences between the 
Application Server and a typical network file system are: 



1. The restriction to handle only Application file data allows the 
Application Server to only service read requests, with writes being 
25 disallowed or handled on the client itself in a copy-on-write 

manner; 



2. Access checks occur at the application level, that is a client is 
given all-or-none access to files for a given software application; 

30 

3. The Application Server is designed to operate across the Internet, 
as opposed to typical network file systems, which are optimized to 
run over LANs. This brings up additional requirements of handling 
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server failures, maximizing network bandwidth and minimizing 
latency, and handling security; and 



4. The Application Server is application-aware, unlike typical network 
5 file systems, which treat all software application files the same as 

all other files. This allows the Application Server to use and collect 
per-applicatlon access profile information along with other 
statistics. 

Ms 

0 10 To service a client request, the Application Server software component 
keeps master copies of the full Application Stream Sets on locally accessible 
persistent storage. In main memory, the Application Server maintains a 
cache of commonly accessed Application File Pages. The primary steps 
taken by the Application Server to service a client request are: 
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1 . Receive and decode the client request; 



2. Validate the client's privilege to access the requested data, e.g., by 
means of a Kerberos-style ticket issued by a trusted security 
20 service; 



3. Look up the requested data in the main memory cache, and failing 
that, obtain it from the master copy on disk while placing it in the 
cache; and 

25 

4. Return the File Pages to the client over the network. 



The techniques used to reduce latency and improve server scalability (the 
main performance considerations) are described below. 

30 

Server Optimization Features 
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Read-Only File System for Application Files - Because virtually all 
application files (code and data) are never written to by users, virtually the 
entire population of users have identical copies of the application files. Thus 
a system intending to deliver the application files can distribute a single, 
5 fixed image across all servers. The read-only file system presented by the 
Application Server represents this sharing, and eliminates the complexities 
of replication management, e.g., coherency, that occur with traditional 
network file systems. This simplification enables the Application Sen/ers to 
respond to requests more quickly, enables potential caching at intervening 
Q 10 nodes or sharing of caches across clients in a peer-to-peer fashion, and 
II facilitates fail over, since with the read-only file system the Application File 

Pages as identified by the client (by a set of unique numbers) will always 

m globally refer to the same content in all cases. 

urn 



15 Per-page Compression - Overall latency observed by the client can be 
O reduced under low-bandwidth conditions by compressing each Application 

g File Page before sending it Referring to Fig. 18, the benefits of the use of 

compression in the streaming of Application File Pages, is illustrated. The 
client 1801 and server 1802 timelines are shown for a typical transfer of data 
20 versus the same data sent in a compressed form. The client 1 801 requests 
the data from the server 1803. The server 1803 processes the request 1804 
and begins sending the requested data. The timelines then diverge due to 
the ability to stream the compressed data 1805 faster than the 
uncompressed data 1806. 

25 

With respect to Fig. 19, the invention's pre-compression of Application File 
Pages process is shown. The Builder generates the stream application sets 
1901, 1902 which are then pre-compressed by the Stream Application Set 
Post-Processor 1903. The Stream Application Set Post-Processor 1903 
30 stores the compressed application sets in the persistent storage device 
1904. Any client requests for data are serviced by the Application Server 
which sends the pre-compressed data to the requesting client 1905. The 
reduction in size of the data transmitted over the network reduces the time to 
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arrival (though at the cost of some processing time on the client to 
decompress the data). When the bandwidth is low relative to processing 
power, e.g., 256 kbps with a Pentium-lll-600, this can reduce latency 
significantly. 

5 

Page-set Compression - When pages are relatively small, matching the 
typical virtual memory page size of 4kB, adaptive compression algorithms 
cannot deliver the same compression ratios that they can for larger blocks of 
data, e.g., 32 kB or larger. Referring to Fig. 20, when a client 2001 requests 
O 10 multiple Application File Pages at one time 2002, the Application Server 
Q 2006 can concatenate all the requested pages and compress the entire set 

at once 2004, thereby further reducing the latency the client will experience 
due to the improved compression ratio. If the pages have already been 
compressed 2003, then the request is fulfilled from the cache 2007 where 
1 5 the compressed pages are stored. The server 2006 responds to the client's 
request through the transfer of the compressed pages 2005. 



Post-processing of Stream AppHcation Sets - The Application Server may 
want to perform some post processing of the raw Stream Application Sets in 

20 order to reduce its runtime-processing load, thereby improving its 
performance. One example is to pre-compress all Application File Pages 
contained in the Stream Application Sets, saving a great deal of otherwise 
repetitive processing time. Another possibility is to rearrange the format to 
suit the hardware and operating system features, or to reorder the pages to 

25 take advantage of access locality. 



Static and Dynamic Profiling - With respect to Fig. 21, since the same 
application code is executed in conjunction with a particular Stream 
Application Set 2103 each time, there will be a high degree of temporal 
30 locality of referenced Application File Pages, e.g., when a certain feature is 
invoked, most if not all the same code and data is referenced each time to 
perform the operation. These access patterns can be collected into profiles 
2108, which can be shipped to the client 2106 to guide its prefetching (or to 
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guide server-based 2105 prefetching), and they can be used to pre-package 
groups of Application File Pages 2103, 2104 together and compress them 
offline as part of a post-processing step 2101, 2102, 2103. The benefit of the 
latter is that a high compression ratio can be obtained to reduce client 
5 latency without the cost of runtime server processing load (though only 
limited groups of Application File Pages will be available, so requests which 
don't match the profile would get a superset of their request in terms of the 
pre-compressed groups of Application File Pages that are available). 

: 10 Fast Server-Side Client Privilege Checks - Referring to Fig. 22, having to 
track individual user's credentials, i.e., which Applications they have 
' privileges to access, can limit server scalability since ultimately the per-user 

I data must be backed by a database, which can add latency to servicing of 

user requests and can become a central bottleneck. Instead, a separate 



15 License Server 2205 is used to offload per-user operations to grant 

u 

privileges to access application data, and thereby allow the two types of 
I servers 2205, 2210 to scale independently. The License Server 2205 

provides the client an Access Token (similar to a Kerberos ticket) that 
contains information about what application it represents rights for along with 
20 an expiration time. This simplifies the operations required by the Application 
Server 2210 to validate a client's privileges 2212. The Application Server 
221 0 needs only to decrypt the Access Token (or a digest of it) via a secret 
key shared 2209 with the License Server 2205 (thus verifying the Token is 
valid), then checking the validity of its contents, e.g., application identifier, 
25 and testing the expiration time. Clients 2212 presenting Tokens for which all 
checks pass are granted access. The Application Server 2210 needs not 
track anything about individual users or their identities, thus not requiring 
any database operations. To reduce the cost of privilege checks further, the 
Application Server 2210 can keep a list of recently used Access Tokens for 
30 which the checks passed, and if a client passes in a matching Access Token, 
the server need only check the expiration time, with no further decryption 
processing required. 
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Connection Management - Before data is ever transferred from a client to a 
server, the network connection itself takes up one and a half network round 
trips. This latency can adversely impact client performance if it occurs for 
every client request. To avoid this, clients can use a protocol such as HTTP 
5 1.1, which uses persistent connections, /.e., connections stay open for 
multiple requests, reducing the effective connection overhead. Since the 
client-side file system has no knowledge of the request patterns, it will simply 
keep the connection open as long as possible. However, because traffic 
from clients may be bursty, the Application Server may have more open 
10 connections than the operating system can support, many of them being 
temporarily idle. To manage this, the Application Server can aggressively 
close connections that have been idle for a period of time, thereby achieving 
^ a compromise between the client's latency needs and the Application 

r Server's resource constraints. Traditional network file systems do not 

15 manage connections in this manner, as LAN latencies are not high enough 
C3 to be of concern. 
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Application Server Memory Usage/Load Balancing - File servers are heavily 
dependent on main memory for fast access to file data (orders of magnitude 

20 faster than disk accesses). Traditional file servers manage their main 
memory as a cache of file blocks, keeping the most commonly accessed 
ones. With the Application Server, the problem of managing main memory 
efficiently becomes more complicated due to there being multiple servers 
providing a shared set of applications. In this case, if each server managed 

25 its memory independently, and was symmetric with the others, then each 
server would only keep those file blocks most common to all clients, across 
all applications. This would cause the most common file blocks to be in the 
main memory of each and every Application server, and since each server 
would have roughly the same contents in memory, adding more servers 

30 won't improve scalability by much, since not much more data will be present 
in memory for fast access. For example, if there are application A (accessed 
50% of the time), application B (accessed 40% of the time), and application 
C (accessed 10% of the time), and application A and B together consume 
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more memory cache than a single Application Server has, and there are ten 
Application Servers, then none of the Application Servers will have many 
blocks from C in memory, penalizing that application, and doubling the 
number of servers will improve C's performance only minimally. This can be 
5 improved upon by making the Application Servers asymmetric, in that a 
central mechanism, e.g., system administrator, assigns individual Application 
Servers different Application Stream Sets to provide, in accordance with 
popularity of the various applications. Thus, in the above example, of the ten 
servers, five can be dedicated to provide A, four to B, and one to C, (any 
1 0 extra memory available for any application) making a much more effective 
use of the entire memory of the system to satisfy the actual needs of clients. 
-1 This can be taken a step further by dynamically (and automatically) changing 

-\j the assignments of the servers to match client accesses over time, as groups 

of users come and go during different time periods and as applications are 
1 5 added and removed from the system. This can be accomplished by having 
servers summarize their access patterns, send them to a central control 
1.^' server, which then can reassign servers as appropriate. 



CONVERSION OF CONVENTIONAL APPLICATIONS TO ENABLE 
20 STREAMED DELIVERY AND EXECUTION 

The Streamed Application Set Builder is a software program. It is used to 
convert locally installable applications into a data set suitable for streaming 
over a network. The streaming-enabled data set is called the Streamed 
25 Application Set (SAS). This section describes the procedure used to convert 
locally installable applications into the SAS. 

The application conversion procedure Into the SAS consists of several 
phases. In the first phase, the Builder program monitors the installation 
30 process of a local installation of the desired application for conversion. The 
Builder monitors any changes to the system and records those changes In 
an intermediate data structure. After the application is installed locally, the 
Builder enters the second phase of the conversion. In the second phase, the 
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Builder program invokes the installed application executable and obtains 
sequences of frequently accessed file blocks of this application. Both the 
Builder program and the client software use the sequence data to optimize 
the performance of the streaming process. Once the sequencing information 
5 is obtained, the Builder enters the final phase of the conversion. In this 
phase, the Builder gathers all data obtained from the first two phases and 
processes the data into the Streamed Application Set. 

Detailed descriptions of the three phases of the Builder conversion process 
10 are described in the following sections. The three phases consist of 
installation monitoring (IM), application profiling (AP), and SAS packaging 
(SP). In most cases, the conversion process is general and applicable to all 
types of systems. In places where the conversion is OS dependent, the 
discussion is focused on the Microsoft Windows environment. Issues on 
15 conversion procedure for other OS environments are described in later 
sections. 

Installation Monitoring (IM) 

20 In the first phase of the conversion process, the Builder Installation Monitor 
(IM) component invokes the application installation program that installs the 
application locally. The IM observes all changes to the local computer 
during the installation. The changes may involve one or more of the 
following: changes to system or environment variables; and modifications, 

25 addition, or deletion of one or more files. Initial system variables, 
environment variables, and files are accounted for by the IM before the 
installation begins to give a more accurate picture of any changes that are 
observed. The IM records all changes to the variables and files In a data 
structure to be sent to the Builder's Streamed Application Packaging 

30 component. In the following paragraphs, detailed description of the 
Installation Monitor is described for Microsoft Windows environment. 
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In Microsoft Windows system, the Installation Monitor (IM) component 
consists of a kernel-mode driver subcomponent and a user-mode 
subcomponent. The kernel-mode driver is hooked into the system registry 
and file system function interface calls. The hook into the registry function 
calls allows the IM to monitor system variable changes. The hook Into the 
file system function calls enables the IM to observe file changes. 

Installation Monitor Kernel-Mode subcomponent (IM-KM) 

With respect to Fig. 23, the IM-KM subcomponent monitors two classes of 
information during an application installation: system registry modifications 
and file modifications. Different techniques are used for each of these 
classes. 

To monitor system registry modifications 2314, the IM-KM component 
replaces all kernel-mode API calls in the System Service Table that write to 
the system registry with new functions defined in the IM-KM subcomponent. 
When an installation program calls one of the API functions to write to the 
registry 2315, the IM-KM function is called instead, which logs the 
modification data 2317 (including registry key path, value name and value 
data) and then forwards the call to the actual operating system defined 
function 2318. The modification data is made available to the IM-UM 
subcomponent through a mechanism described below. 

To monitor file modifications, a filter driver is attached to the file system's 
driver stack. Each time an installation program modifies a file on the system, 
a function is called in the IM-KM subcomponent, which logs the modification 
data (including file path and name) and makes it available to the IM-UM 
using a mechanism described below. 

The mechanisms used for monitoring registry modifications and file 
modifications will capture modifications made by any of the processes 
currently active on the computer system. While the installation program is 
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running, other processes that, for example, operate the desktop and service 
network connections may be running and may also modify files or registry 
data during the Installation. This data must be removed from the modification 
data to avoid inclusion of modifications that are not part of the application 
5 installation. The IM-KM uses process monitoring to perform this filtering. 

To do process monitoring, the IM-KM installs a process notification callback 
function that is called each time a process is created or destroyed by the 
operating system. Using this callback function, the operating system sends 
the created process ID as well as the process ID of the creator (or parent) 
process. The IM-KM uses this information, along with the process ID of the 
IM-UM, to create a list of all of the processes created during the application 
installation. The IM-KM uses the following algorithm to create this list: 

1 . Before the installation program is launched by the IM-UM, the 
IM-UM passes its own process ID to the IM-KM. Since the IM- 
UM is launching the installation application, the IM-UM will be 
the ancestor (parent, grandparent, etc.) of any process (with 
one exception - the Installer Service described below) that 
modifies files or registry data as part of the application 
installation. 

2. When the installation is launched and begins the creating 
processes, the IM-KM process monitoring logic is notified by 

25 the operating system via the process notification callback 

function. 

3. If the creator (parent) process ID sent to the process notification 
callback function is already in the process list, the new process 

30 is Included in the list. 

When an application on the system modifies either the registry or files, and 
the IM-KM monitoring logic captures the modification data, but before making 
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it available to the IM-UM, it first ciiecks to see if the process that modified the 
registry or file is part of the process list. It is only made available to the IM-UM 
if It is in the process list. 



5 It is possible that a process that is not a process ancestor of the IM-UM will 
make changes to the system as a proxy for the installation application. Using 
Interprocess communication, an installation program may request than an 
Installer Service make changes to the machine. In order for the IM-KM to 

capture changes made by the Installer Service, the process monitoring logic 
10 includes a simple rule that also includes any registry or file changes that 
5 have been made by a process with the same name as the Installer Service 

process. On Windows 2000, for example, the Installer Service is called 
"msi.exe". 



15 Installation Monitor User-Mode subcomponent (IM-UM) 



The IM kernel-mode (IM-KM) driver subcomponent is controlled by the user- 
Li mode subcomponent (IM-UM). The IM-UM sends messages to the IM-KM to 
start 2305 and stop 2309 the monitoring process via standard I/O control 
20 messages known as lOCTLs. The message that starts the IM-KM also 
passes in the process ID of the IM-UM to facilitate process monitoring 
described in the IM-KM description. 



When the installation program 2306 modifies the computer system, the IM- 
25 KM signals a named kernel event. The IM-UM listens for these events during 
the installation. When one of these events is signaled, the IM-UM calls the 
IM-KM using an lOCTL message. In response, the IM-KM packages data 
describing the modification and sends it to the IM-UM 231 8. 



30 The IM-UM sorts this data and removes duplicates. Also, it parameterizes all 
local-system-specific registry keys, value names, and values. For example, 
an application will often store paths in the registry that allow it to find certain 
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files at run-time. These path specifications must be replaced with parameters 
that can be recognized by the client installation software. 



A user interface is provided for the IM-UM that allows an operator of the 
5 Builder to browse through the changes made to the machine and to edit the 
modification data before the data is packaged into an SAS. 



20 



Once the installation of an application is completed 2308, the IM-UM 
forwards data structures representing the file and registry modifications to 



Q 10 the Streamed Application Packager 2312. 



Monitoring Application Configuration 



^ Using the techniques described above for monitoring file modifications and 

M 15 monitoring registry modifications, the builder can also monitor a running 

Q application that is being configured for a particular working environment. The 

B 

M on multiple machines, making it unnecessary for each user to configure 



data acquired by the IM-UM can be used to duplicate the same configuration 
on multiple machines, making it v 
his/her own application Installation. 



An example of this is a client server application for which the client will be 
streamed to the client computer system. Common configuration 
modifications can be captured by the IM and packed into the SAS. When the 
application is streamed to the client machine, it is already configured to 
25 attach to the server and begin operation. 



Application Profiling (AP) 



Referring to Fig. 24, In the second phase of the conversion process, the 
30 Builder's Application Profiler (AP) component invokes the application 
executable program that is installed during the first phase of the conversion 
process. Given a particular user input, the executable program file blocks 
are accessed in a particular sequence. The purpose of the AP is to capture 
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the sequence data associated with some user inputs. This data is useful in 
several ways. 

First of all, frequently used file blocks can be streamed to the client machine 
before other less used file blocks. A frequently used file block is cached 
locally on the client cache before the user starts using the streamed 
application for the first time. This has the effect of making the streamed 
application as responsive to the user as the locally installed application by 
hiding any long network latency and bandwidth problems. 



Q Secondly, the frequently accessed files can be reordered in the directory to 

iri 

allow faster lookup of the file information. This optimization is useful for 



directories with large number of files. When the client machine looks up a 
frequently used file in a directory, it finds this file early in the directory search. 
T]. 15 In an application run with many directory queries, the performance gain is 



significant. 

Finally, the association of a set of file blocks with a particular user input 
allows the client machine to request minimum amount of data needed to 
20 respond to that particular user command. The profile data association with a 
user command is sent from the server to the client machine in the 
ApplnstallBlock during the 'preparation' of the client machine for streaming. 
When the user on a client machine invokes a particular command, the codes 
corresponding to this command are prefetched from the server. 

25 

The Application Profiler (AP) is not as tied to the system as the Installation 
Monitor (IM) but there are still some OS dependent issues. In the Windows 
system, the AP still has two subcomponents: kernel-mode (AP-KM) 
subcomponent and the user-mode (AP-UM) subcomponent The AP-UM 
30 invokes the converting application executable. Then AP-UM starts the AP- 
KM 2403, 2413 to track the sequences of file block accesses by the 
application 2414. Finally when the application exits after the pre-specified 
amount of sequence data Is gathered, the AP-UM retrieves the data from AP- 
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KM 2406, 2417 and forwards the data to the Streamed Application Packager 
2411, 



Streamed Application Set Packaging (SP) 

5 

With respect to Fig. 25, in the final phase of the conversion process, the 
Builder's Streamed Application Set Packager (SP) component processes 
the data structure from IM and AP to create a data set suitable for streaming 
over the network. This converted data set is called the Streamed Application 
53 10 Set 2520 and is suitable for uploading to the Streamed Application Servers 
^ for subsequent downloading by the stream client. Figure 23 shows the 

tfl control flow of the SP module. 



Each file included in a Streamed Application Set 2520 is assigned a file 
f^' 1 5 number that identifies it within the SAS. 

The Streamed Application Set 2520 consists of the three sets of data from 
the Streamed Application Server's perspective. The three types of data are 
the Concatenation Application File (CAF) 2519, 2515, the Size Offset File 
20 Table (SOFT)2518, 2514, 2507, and the Root Versioning Table (RVT) 2518, 
2514. 



The CAF 2519, 2515 consists of all the files and directories needed to 
stream to the client. The CAF can be further divided into two subsets: 
25 initialization data set and the runtime data set 



The initialization data set is the first set of data to be streamed from the 
server to the client. This data set contains the information captured by IM 
and AP needed by the client to prepare the client machine for streaming this 
30 particular application. This initialization data set is also called the 
ApplnstallBlock (AIB) 2516, 2512. In addition to the data captured by the IM 
and AP modules, the SP is also responsible for merging any new dynamic 
profile data gathered from the client and the server. This data is merged into 
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the existing ApplnstallBlock to optimize subsequent streaming of the 
application 2506. With the list of files obtained by the IM during application 
installation, the SP module separates the list of files into regular streamed 
files and the spoof files. The spoof files consists of those files not installed 
5 into standard application directory. This includes files installed into system 
directories and user specific directories. The detailed format description of 
the ApplnstallBlock is described later. 

The second part of the CAF consists of the runtime data set. This is the rest 
Q 10 of the data that is streamed to the client once the client machine is initialized 



^1 



for this particular application. The runtime data consists of all the regular 
application files and the directories containing information about those 
application files. Detailed format description of the runtime data in the CAF 
section is described below. The SP appends every file recorded by IM into 



15 the CAF and generates all directories. Each directory contains list of file 
Q name, file number, and the metadata associated with the files in that 

particular directory 251 7, 251 3. 



The SP is also responsible for generating the SOFT file 2518, 2514, 2507. 
20 This is a table used to index into the CAF for determining the start and the 
end of a file. The server uses this information to quickly access the proper 
file within the directory for serving the proper file blocks to the client. 

Finally, the SP creates the RVT file 2518, 2514. The Root Versioning Table 
25 contains a list of root file number and version number. This information is 
used to track minor application patches and upgrades. Each entry in the 
RVT corresponds to one patch level of the application with a corresponding 
new root directory. The SP generates new parent directories when any 
single file in that subdirectory tree is changed from the patched upgrade. 
30 The RVT is uploaded to the server and requested by the client at appropriate 
time for the most updated version of the application by a simple comparison 
of the client's Streamed Application root file number with the RVT table 
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located on the server once the client Is granted access authorization to 
retrieve the data. 



10 



With respect to Figs. 26a and 26b, the internal representation of a simple 
SAS before and after a new file is added to a new version of an application 
Is shown. The original CAF 2601 has the new files 2607 appended to it 
2604 by the SP. The SOFT 2602 is correspondingly updated 2605 with the 
appropriate table entries 2608 to index the new flies 2607 the CAF 2604. 
Finally, the RVT 2603 is updated 2606 to reflect the new version 2609. 

Data Flow Description 



The following list describes the data that is passed from one component to 
another. The numbers corresponds to the numbering in the Data Flow 
15 diagram of Fig. 27. 



Install Monitor 

1 , The full pathname of the installer program is queried from the 
user by the Builder program and is sent to the Install Monitor 
20 2. The Install Monitor (IM) user-mode sends a read request to the 

OS to spawn a new process for installing the application on the 
local machine. 

3. The OS loads the application installer program into memory 
and runs the application installer program. OS returns the 

25 process ID to the IM, 

4. The application program Is started by the IM-UM. 

5. The application installer program sends read request to the OS 
to read the content of the CD. 

6. The CD media data files are read from the CD. 

30 7. The files are written to the appropriate location on the local 

hard-drive. 
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8. IM kernel-mode captures all file read/write requests and all 
registry read/write requests by the application installer 
program. 

9. IM user-mode program starts the IM kernel-mode program and 
5 sends the request to start capturing all relevant file and registry 

data. 

10. IM kernel-mode program sends the list of all file modifications, 
additions, and deletions; and all registry modifications, 
additions, and deletions to the IM user-mode program. 

10 11. IM informs the SAS Builder Ul that the installation monitoring 

has completed and displays the file and registry data in a 
graphical user interface. 

Application Profiler 

H 15 12. Builder Ul invokes Application Profiling (AP) user-mode 

□ program by querying the user for the list of application 

JiL; executable names to be profiled. The AP user-mode also 

queries the user for division of file blocks into sections 
corresponding to the commands Invoked by the user of the 
20 application being profiled. 

13. Application Profiler user-mode invokes each application 
executable in succession by spawning each program in a new 
process. The OS loads the application executable into 
memory, runs the application executable, and returns the 

25 process ID to the Application Profiler. 

14. During execution, the OS on behalf of the application, sends 
the request to the hard-drive controller to read the appropriate 
file blocks into memory as needed by the application, 

15. The hard-drive controller returns all file blocks requested by the 
30 OS. 

16. Every file access to load the application file blocks into memory 
is monitored by the Application Profiler (AP) kernel-mode 
program. 
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17. The AP user-mode program informs the AP kernel-mode 
program to start monitoring relevant file accesses. 

18. Application Profiler kernel-mode returns the file access 
sequence and frequency information to the user-mode 

5 program. 

19. Application Profiler returns the processed profile information. 
This has two sections. The first section is used to identify the 
frequency of files accessed. The second section is used to list 
the file blocks for prefetch to the client. The file blocks can be 

10 further categorized into subsections according to the 

commands invoked by the user of the application. 



SAS Packager 

20. The Streamed Application Packager receives files and registry 
1 5 changes from the Builder Ul. It also receives the file access 

frequency and a list of file blocks from the Profiler. File numbers 
are assigned to each file. 

21 . The Streamed Application Packager reads all the file data from 
the hard-drive that are copied there by the application installer. 

20 22. The Streamed Application Packager also reads the previous 

version of Streamed Application Set for support of minor patch 
upgrades. 

23. Finally, the new Streamed Application Set data is stored back 
to non-volatile storage. 
25 24. For new profile data gathered after the SAS has been created, 

the packager is invoked to update the AppinstallBlock in the 
SAS with the new profile information. 



Mapping of Data Flow to Streamed Application Set (SAS) 
30 Step 7: Data gathered from this step consist of the registry and file 

modification, addition, and deletion. The data are mapped to 
the ApplnstallBlock's File Section, Add Registry Section, and 
Remove Registry Section. 
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step 8 & 19: File data are copied to tlie local hard-drive then 
concatenated into part of the CAF contents. Part of the data is 
identified as spoof or copied files and the file names and/or 
contents are added to the ApplnstallBlock. 
5 Step 15 & 21: Part of the data gathered by the Profiler or gathered 

dynamically by the client is used in the ApplnstallBlock as a 
prefetch hint to the client. Another part of the data is used to 
generate a more efficient SAS Directory content by ordering 
the files according the usage frequency. 
10 Step 20: If the installation program was an upgrade, SAS Packager 

needs previous version of the Streamed Application Set data. 
rS Appropriate data from the previous version are combined with 

^ the new data to form the new Streamed Application Set. 

t y 

.in 

14, 1 5 Format of Streamed Application Set 



Referring to Fig. 28, the format of the Streamed Application Set consists of 
2 three sections: Root Version Table (RVT) 2802, Size Offeet File Table 

(SOFT) 2803, and Concatenation Application File (CAF) 2801. The RVT 
20 section 2802 lists all versions of the root file numbers available in a 
Streamed Application Set. The SOFT 2803 section consists of the pointers 
into the CAF 2801 section for every file in the CAF 2801 . The CAF section 
2801 contains the concatenation of all the files associated with the streamed 
application. The CAF section 2801 is made up of regular application files, 
25 SAS directory files 2805, ApplnstallBlock 2804, and icon files. See below 
for detailed information on the content of the SAS file. 

OS Dependent Format 

30 The format of the Streamed Application Set is designed to be as portable as 
possible across all OS platforms. At the highest level, the format of CAF, 
SOFT, and RVT that make up the format of the Streamed Application Set are 
completely portable across any OS platforms. One piece of data structure 
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that is OS dependent is located 4n the initialization data set called 
ApplnstallBlock in the CAF. This data is dependent on the type of OS due to 
the differences in low-level system differences among different OS. For 
example, Microsoft Windows contains system environment variables called 
5 the Registry. The Registry has a particular tree format not found in other 
operating systems like UNIX or MacOS. 

Another OS dependent piece of data is located in the SAS directory files in 
the CAF. The directory contains file metadata Information specific to 
M 10 Windows files. For example on the UNIX platform, there does not exist a 
ii hidden flag. This platform specific information needs to be transmitted to the 

7i client to fool the streamed application into believing that the application data 

N is located natively on the client machine with all the associated file metadata 

intact. If SAS is to be used to support streaming of UNIX or MacOS 
1 5 applications, file metadata specific to those systems will need to be recorded 
If in the SAS directory. 



Lastly, the format of the file names itself Is OS dependent. Applications 
running on the Windows environment inherit the old MSDOS 8.3 file name 
20 format To support this properly, the format of the SAS Directory file in CAF 
requires an additional 8.3 field to store this information. This field is not 
needed in other operating systems like UNIX or MacOS. 
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Device Driver Versus File System Paradigm 



Referring to Figs. 29 and 30, the SAS client Prototype is implemented using 
the 'device driver' paradigm. One of the advantages of the device driver 
approach is that the caching of the sector blocks is simpler. In the device 
driver approach, the client cache manager 2902 only needs to track sector 
30 numbers in its cache 2903. In comparison with the 'file system' paradigm, 
more complex data structure are required by the client cache manager 3002 
to track a subset of a file that is cached 3003 on a client machine. This 
makes the device driver paradigm easier to implement. 
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On the other hand, there are many drawbacks to the device driver paradigm. 
On the Windows system, the device driver approach has a problem 
supporting large numbers of applications. This is due to the phantom 
5 limitation on the number of assignable drive letters available in a Windows 
system (26 letters); and the fact that each application needs to be located on 
its own device. Note that having multiple applications on a device is 
possible, but then the server needs to maintain an exponential number of 
devices that support all possible combinations of applications. This is too 
^ 1 0 costly to maintain on the server. 
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Another problem with the device driver approach is that the device driver 
operates at the disk sector level. This is a much lower level than operating at 

s s ? 

iO the file level in the file system approach. The device driver does not know 

^ 1 5 anything about files. Thus, the device driver cannot easily interact with the 
^ file level issues. For example, spoofing files and interacting with the OS file 

cache is nearly impossible with the device driver approach. Both spoofing 
files and interacting with the OS buffer cache are needed to get higher 
performance. In addition, operating at the file system level lends to 
20 optimizing the file system to better suit this approach of running applications. 
For instance, typical file systems do logging and make multiple disk sector 
requests at a time. These are not needed in this approach and are actually 
detrimental to the performance. When operating at the device driver level, 
not much can be done about that. Also, operating at the file system level 
25 helps in optimizing the protocol between the client and the server. 



Implementation in the Prototype 



The prototype has been implemented and tested successfully on the 
30 Windows and Linux distributed system. The prototype is implemented using 
the device driver paradigm as described above. The exact procedure for 
streaming application data is described next. 



67 



First of all, the prototype server is started on either the Windows-based or 
Linux-based system. The server creates a large local file mimicking large 
local disk images. Once the disk images are prepared, it listens to TCP/IP 
ports for any disk sector read or write requests. 

Implementation of SAS Builder 

The SAS Builder has been implemented on the Windows-based platfonm. A 
preliminary Streamed Application Set file has been created for real-world 
applications like Adobe Photoshop. A simple extractor program has been 
developed to extract the SAS data on a pristine machine without the 
application installed locally. Once the extractor program is run on the SAS, 
the application runs as if it was installed locally on that machine. This 
process verifies the correctness of the SAS Building process. 

FORMAT OF STREAMED APPLICATION SET (SAS) 
Functionality 

The streamed application set (SAS), illustrated in Rg. 28, is a data set 
associated with an application suitable for streaming over the network. The 
SAS is generated by the SAS Builder program. The program converts 
locally installable applications into the SAS. This section describes the 
format of the SAS. 

Note: Fields greater than a single byte are stored in little-endian format. The 
Stream Application Set (SAS) file size is limited to 2^ bytes. The files in 
the CAF section are laid out in the same order as its corresponding entries in 
the SOFT table. 

Data Type Definitions 
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The format of the SAS consists of four sections: header, Root Version Table 
(RVT), Size Offset File Table (SOFT), and Concatenation Application File 
(CAF) sections. 



1 ■ Header section 

o MagicNumber [4 bytes]: Magic number identifying the file 

content with the SAS. 
o ESSVersion [4 bytes]: Version number of the SAS file format, 
o AppID [16 bytes]: A unique application ID for this application. 

This field must match the AppID located in the ApplnstallBlock. 

Window Guidgen API is used to create this identifier, 
o Flags [4 bytes]: Flags pertaining to SAS. 
o Reserved [32 bytes]: Reserved spaces for future, 
o RVToffset [8 bytes]: Byte offset into the start of the RVT section, 
o RVTsize [8 bytes]: Byte size of the RVT section, 
o SOFToffset [8 bytes]: Byte offset into the start of the SOFT 

section. 

o SOFTsIze [8 bytes]: Byte size of the SOFT section. 

o CAFoffset [8 bytes]: Byte offset into the start of the CAF section. 

o CAFsize [8 bytes]: Byte size of the CAF section. 

o VendorNamelsAnsi [1 byte]: 0 if the vendor name is in Unicode 

format. 1 if the vendor name is in ANSI format, 
o VendorNameLength [4 bytes]: Byte length of the vendor name, 
o VendorName p< bytes]: Name of the software vendor who 

created this application, e.g., "Microsoff . Null-terminated, 
o AppBaseNamelsAnsi [1 byte]: 0 if the vendor name is in 

Unicode format. 1 if the vendor name is in ANSI format, 
o AppBaseNameLength [4 bytes]: Byte length of the application 

base name. 

o AppBaseName [X bytes]: Base name of the application, e.g., 
"Word 2000". Null-terminated. 
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o MessagelsAnsi [1 byte]: 0 if the vendor name is in Unicode 

format. 1 if the vendor name is in ANSI format, 
o IVlessageLength [4 bytes]: Byte length of the message text, 
o Message [X bytes]: Message text. NulWerminated. 

5 

2. Root Version Table (RVT) section 

The Root version entries are ordered in a decreasing value according 
to their file numbers. The Builder generates unique file numbers 

within each SAS in a monotonically increasing value. So larger root 
10 file numbers imply later versions of the same application. The latest 

root version is located at the top of the section to allow the SAS 
Server easy access to the data associated with the latest root version. 



m 



^ o NumberEntries [4 bytes]: Number of patch versions contained 

u 15 in this SAS. The number indicates the number of entries in the 



Root Version Table (RVT). 

Root Version structure: (variable number of entries) 

o VersionNumber [4 bytes]: Version number of the root directory. 
20 o FileNumber [4 bytes]: File number of the root directory. 

o VersionNamelsAnsi [1 byte]: 0 if the vendor name Is in Unicode 

format. 1 if the vendor name is in ANSI format, 
o VersionNameLength [4 bytes]: Byte length of the version name 
o VersionName [X bytes]: Application version name, e.g., "SP 1". 
25 o Metadata [32 bytes]: See SAS FS Directory for format of the 

metadata. 



3. Size Offset File Table (SOFT) section 

The SOFT table contains information to locate specific files In the CAF 
30 section. The entries are ordered according to the file number starting 

from 0 to NumberFiles-1 . The start of the SOFT table Is aligned to 
eight-byte boundaries for faster access. 
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SOFT entry structure: (variable number of entries) 

o Offset [8 bytes]: Byte offset into CAF of the start of this file, 
o Size [8 bytes]: Byte size of this file. The file is located from 
address Offset to Offset+Size. 

Concatenation Application File (CAF) section 

CAF is a concatenation of all file or directory data Into a single data 
structure. Each piece of data can be a regular file, an 
ApplnstallBlock, an SAS FS directory file, or an icon file. 



S a. Regular Files 

o FileData [X bytes]: Content of a regular file 

.a 

b. ApplnstallBlock (See ApplnstallBlock section for detailed format) 
p 15 A simplified description of the ApplnstallBlock is listed here. The 

exact detail of the individual fields in the ApplnstallBlock are 
described later. 

o Header section [X bytes]: Header for ApplnstallBlock containing 
20 information to identify this ApplnstallBlock. 

o Files section \X bytes]: Section containing file to be copied or 
spoofed. 

o AddVariable section [X bytes]: Section containing system 
variables to be added. 
25 o RemoveVariable section [X bytes]: Section containing system 

variables to be removed, 
o Prefetch section [X bytes]: Section containing pointers to file 

blocks to be prefetched to the client, 
o Profile section [X bytes]: Section containing profile data. 
30 o Comment section [X bytes]: Section containing comments 

about ApplnstallBlock. 
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o Code section [X bytes]: Section containing application-specific 
code needed to prepare local machine for streaming this 
application 

o LicenseAgreement section [X bytes]: Section containing 
licensing agreement message. 

SAS Directory 

An SAS Directory contains information about the subdirectories 
and files located within this directory. This information is used to 
store metadata information related to the files associated with the 
streamed application. This data is used to fool the application into 
thinking that it is running locally on a machine when most of the 
data is resided elsewhere. 

The SAS directory contains information about files in its directory. 
The information includes file number, names, and metadata 
associated with the files. 

o MagicNumber [4 bytes]: Magic number for SAS directory file, 
o ParentFilelD [16+4 bytes]: ApplD+FileNumber of the parent 

directory. AppID is set to 0 if the directory is the root, 
o SelfFilelD [16+4 bytes]: ApplD+FileNumber of this directory, 
o NumFiles [4 bytes]: Number of files in the directory. 

Variable-Sized File Entry: 

o UsedFIag [1 byte]: 1 for used, 0 for unused. 

o ShortLen [1 byte]: Length of short file name. 

o LongLen [2 byte]: Length of long file name. 

o NameHash [4 bytes]: Hash value of the short file name for quick 
lookup without comparing whole string. 

o ShortName [24 bytes]: 8.3 short file name in Unicode. Not null- 
terminated. 



72 



o FilelD [16+4 bytes]: ApplD+FileNumber of each file in this 
directory. 

o Metadata [32 bytes]: The metadata consists of file byte size (8 
bytes), file creation time (8 bytes), file modified time (8 bytes), 
attribute flags (4 bytes), SAS flags (4 bytes). The bits of the 
attribute flags have the following meaning: 

■ Bit 0: Read-only - Set if file is read-only 

■ Bit 1 : Hidden - Set if file is hidden from user 

■ Bit 2: Directory - Set if the file is an SAS Directory 

■ Bit 3: Archive - Set if the file is an archive 

■ Bit 4: Normal - Set if the file is normal 

■ Bit 5: System - Set if the file is a system file 

■ Bit 6: Temporary - Set if the file is temporary 

The bits of the SAS flags have the following meaning: 

■ Bit 0: ForceUpgrade - Used only on root file. Set 
if client is forced to upgrade to this particular 
version if the current root version on the client is 
older. 

■ Bit 1: RequireAccessToken - Set if file require 
access token before client can read it. 

■ Bit 2: Read-only - Set if the file is read-only 

o LongName [X bytes]: Long filename In Unicode format with 
null-termination character. 

Icon files 

o IconFileData [X bytes]: Content of an Icon file. 
FORMAT OF APPINSTALLBLOCK 
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Functionality 



With respect to Figs. 31a-31h, the ApplnstallBlock Is a block of code and 
data associated with a particular application. This ApplnstallBlock contains 
5 the information needed to by the SAS client to initialize the client machine 
before the streamed application is used for the first time. It also contains 
optional profiling data for increasing the runtime performance of that 
streamed application. 

H 10 The ApplnstallBlock is created offline by the SAS Builder program. First of 
CI all, the Builder monitors the installation process of a local version of the 

tl application installation program and records changes to the system. This 

includes any environment variables added or removed from the system 
^ 3103, and any files added or modified in the system directories 3102. Files 

1,^ 15 added to the application specific directory are not recorded in the 
J^fJ ApplnstallBlock to reduce the amount of time needed to send the 

Q ApplnstallBlock to the SAS client. Secondly, the Builder profiles the 

application to obtain the list of critical pages needed to run the application 
initially and an initial page reference sequence of the pages accessed 
20 during a sample run of the application. The ApplnstallBlock contains an 
optional application-specific initialization code 3107. This code is needed 
when the default initialization procedure is insufficient to setup the local 
machine environment for that particular application. 



25 The ApplnstallBlock and the runtime data are packaged into the SAS by the 
Builder and then uploaded to the application server. After the SAS client is 
subscribed to an application and before the application is run for the first 
time, the ApplnstallBlock is sent by the server to the client. The SAS client 
invokes the default initialization procedure and the optional application- 

30 specific initialization code 3107. Together, the default and the application- 
specific initialization procedure process the data in the ApplnstallBlock to 
make the machine ready for streaming that particular application. 
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Data type definitions 

The ApplnstallBlock is divided into tfie following sections: header section 
3101, variable section 3103, file section 3102, profile section 3105, prefetch 
section 3104, comment section 3106, and code section 3107. The header 
section 3101 contains general information about the ApplnstallBlock. The 
information includes the total byte size and an index table containing size 
and offset into other sections. In a Windows version, the variable section 
consists of two registry tree structures to specify the registry entries added or 
removed from the OS environment. The file section 3102 is a tree structure 
consisting of the files copied to C drive during the application installation. 
The profile section 3105 contains the initial set of block reference sequences 
during Builder profiling of the application. The prefetch section 3104 
consists of a subset of profiled blocks used by the Builder as a hint to the 
SAS client to prefetch initially. The comment section 3106 is used to inform 
the SAS client user of any relevant information about the application 
installation. Finally, the code section 3107 contains an optional program 
tailored for any application-specific installation not covered by the default 
streamed application installation procedure. In Windows version, the code 
section contains a Windows DLL. The following is a detailed description of 
each fields of the ApplnstallBlock. 

Note: Little endian format is used for all the fields spanning more than 1 byte. 
Also, BlockNumber specifies blocks of 4K byte size. 

1 . Header Section: 

The header section 3103 contains the basic information about this 
ApplnstallBlock. This includes the versioning information, application 
identification, and index into other sections of the file. 

Core Header Structure: 
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o AibVersion [4 bytes]: Magic number or applnstallBlock version 

number (whicli identifies the version of tlie applnstallBlock 

structure rather than the contents), 
o Appid [16 bytes]: this is an application identifier unique for each 

application. On Windows, this identifier is the GUID generated 

from the 'guidgen' program. AppId for Word on Win98 will be 

different from Word on WinNT if it turns out that Word binaries are 

different between NT and 98. 
o VerslonNo [4 bytes]: Version number. This allows us to inform the 

client that the applnstallBlock has changed for a particular appld. 

This is useful for changes to the ApplnstallBlock due to minor 

patch upgrades in the application, 
o ClientOSBitMap [4 bytes]: Client OS supported bitmap or ID: for 

Win2K, Win98, WinNT (and generally for other and multiple OSs). 
o ClientOSServicePack [4 bytes]: For optional storage of the service 

pack level of the OS for which this applnstallBlock has been 

created. Note that when this field is set, the multiple OS bits in the 

above field ClientOSBitMap are not used, 
o Flags [4 bytes]: Flags pertaining to ApplnstallBlock 

■ Bit 0: Reboot - If set, the SAS client needs to reboot the 
machine after installing the ApplnstallBlock on the client 
machine. 

■ Bit 1 : Unicode - If set, the string characters are 2 bytes wide 
instead of 1 byte. 

o HeaderSize [2 bytes]: Total size in bytes of the header section. 

o Reserved [32 bytes]: Reserved spaces for future. 

o NumberOfSections [1 byte]: Number of sections in the index table. 

This determines the number of entries in the index table structure 

described below: 

Index Table Structure: (variable number of entries) 
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o SectlonType [1 bytes]: The type of data described in section. 0=file 
section, 1=variable section, 2=prefetch section, 3=profile section, 
4=:comment section, 5=code section. 

o SectionOffset [4 bytes]: The offset from the beginning of the file 
5 indicates the beginning of section. 

o SectionSize [4 bytes]: The size in bytes of section. 



Variable Structure: 

10 o ApplicationNamelsAnsi [1 byte]: 1 if ansi, 0 if Unicode. 

o ApplicationNameLength [4 bytes]: Byte size of the application 
name 

o ApplicationName [X bytes]: Null terminating name of the 
application 



2- File Section: 



in 

13 The file section 3102 contains a subset of the list of files needed by the 

^ application to run properly. This section does not enumerate files located 

in the standard application program directory. It consists of information 

20 about files copied into an 'unusual' directory during the installation of an 
application. If the file content is small (typically less than 1 MB), the file is 
copied to the client machine. Otherwise, the file is relocated to the 
standard program directory suitable for streaming. The file section data is 
a list of trees stored in a contiguous sequence of address spaces 

25 according to the pre-order traversal of the trees. A node in the tree can 
correspond to one or more levels of directories. A parent-child node pair 
is combined into a single node If the parent node has only a single child. 
Parsing the tree from the root of the tree to a leaf node results in a fully 
legal Windows pathname Including the drive letter. Each entry of the 

30 node in the tree consists of the following structure: 



Directory Structure: (variable number of entries) 
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o Flags [4 byte]: Bit 0 is set if this entry is a directory 
o NumberOfChildren [2 bytes]: Number of nodes in this directory 
o DirectoryNameLength [4 bytes]: Length of the directory name 
o DirectoryName [X bytes]: Null terminating directory name 

5 

Leaf Structure: (variable number of entries) 

Flags [4 byte]: Bit 1 is set to 1 if this entry is a spoof or copied file 
name 

FileVersion [8 bytes]: Version of the file GetFileVersionlnfo() if the 
file is Win32 file image. Need variable file version size returned by 
GetFileVersionlnfoSize(). OthenA/ise use file size or file modified 
time to compare which file is the later version. 
FileNameLength [4 bytes]: Byte size of the file name 
FileName [X bytes]: Null terminating file name 
DataLength [4 bytes]: Byte size of the data. If spoof file, then data 
is the string of the spoof directory. If copied file, then data is the 
content of the file 

Data [X bytes]: Either the spoof file name or the content of the 
copied file 

3. Add Variable and Remove Variable Sections: 

The add and remove variable sections 3103 contain the system 
variable changes needed to run the application. In a Windows 

25 system, each section consists of several number of registry subtrees. 

Each tree is stored in a contiguous sequence of address spaces 
according to the pre-order traversal of the tree. A node in the tree can 
correspond to one or more levels of directory in the registry. A parent- 
child node pair is combined into a single node if the parent node has 

30 only a single child. Parsing the tree from the root of the tree to a leaf 

node results In a fully legal key name. The order of the trees is shown 
here. 
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Registry Subsection: 

1. "HKCR": HKEY^CLASSES„ROOT 

2. "HKCU": HKEY^CURRENT^USER 

3. "HKLM": HKEY^LOCAL^MACHINE 

4. "HKUS*': HKEY^USERS 

5. "HKCC": HKEY^CURRENT_CONFIG 

Tree Structure: (5 entries) 

o ExistFlag [1 byte]: Set to 1 if this tree exist, 0 otherwise, 
o Key or Value Structure entries [X bytes]: Serialization of the tree 
into variable number key or value structures described below. 

Key Structure: (variable number of entries) 

o KeyFlag [1 byte]: Set to 1 if this entry is a key or 0 if if s a value 
structure 

o NumberOfSubchild [4 bytes]: Number of subkeys and values in 

this key directory 
o KeyNameLength [4 bytes]: Byte size of the key name 
o KeyName [X bytes]: Null terminating key name 

Value Structure: (variable number of entries) 

o KeyFlag [1 byte]: Set to 1 if this entry is a key or 0 if ifs a value 
structure 

o ValueType [4 byte]: Type of values from the Win32 API function 
RegQueryValueEx(): REG_SZ, REG^BINARY, REG^DWORD, 
REG^LINK, REG^NONE, etc... 
o ValueNameLength [4 bytes]: Byte size of the value name 
o ValueName [X bytes]: Null terminating value name 
o ValueDataLength [4 bytes]: Byte size of the value data 
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o ValueData [X bytes]: Value of the Data 



In addition to registry changes, an installation in a Windows system 
may involve changes to the ini files. The following structure is used to 
5 communicate the ini file changes needed to be done on the SAS 

client machine. The ini entries are appended to the end of the 
variable section after the five registry trees are enumerated. 



b. INI Subsection: 



'^^10 o NumFiles [4 bytes]: Number of INI files modified. 

File Structure: (variable number of entries) 
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M o FileNameLength [4 bytes]: Byte length of the file name 

k 

I 1 5 o FileName [X bytes]: Name of the INI file 

o NumSection [4 bytes]: Number of sections with the changes 



Section Structure: (variable number of entries) 



20 o SectionNameLength [4 bytes]: Byte length of the section name 

o SectionName (X bytes]: Section name of an INI file 
o NumValues [4 bytes]: Number of values in this section 



Value Structure: (variable number of entries) 

25 

o ValueLength [4 bytes]: Byte length of the value data 
o ValueData [X bytes]: Content of the value data 



4. Prefetch Section: 

30 The prefetch section 3104 contains a list of file blocks. The Builder 
profiler determines the set of file blocks critical for the initial run of the 
application. This data includes the code to start and terminate the 
application. |t includes the file blocks containing frequently used 
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commands. For example, opening and saving of documents are 
frequently used commands and should be prefetched if possible. 
Another type of block to include In the prefetch section are the blocks 
associated with frequently accessed directories and file metadata in this 
directory. The prefetch section is divided Into two subsections. One part 
contains the critical blocks that are used during startup of the streamed 
application. The second part consists of the blocks accessed for common 
user operations like opening and saving of document. The format of the 
data is described below: 

a. Critical Block Subsection: 
o NumCriticalBlocks [4 bytes]: Number of critical blocks. 
Block Structure: (variable number of entries) 

o FileNumber [4 bytes]: File Number of the file containing the block 
to prefetch 

o BlockNumber [4 bytes]: Block Number of the file block to prefetch 

b. Common Block Subsection: 
o NumCommonBlocks [4 bytes]: Number of critical blocks. 

25 Block Structure: (variable number of entries) 

o FileNumber [4 bytes]: File Number of the file containing the block 
to prefetch 

o BlockNumber [4 bytes]: Block Number of the file block to prefetch 

30 

5. Profile Section: 

The profile section 3105 consists of a reference sequence of file blocks 
accessed by the application at runtime. Conceptually, the profile data is 
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a two dimensional matrix. Each entry [row, column] of the matrix is the 
frequency, a block row is followed by a block column. In any realistic 
applications of fair size, this matrix is very large and sparse. The proper 
data structure must be selected to store this sparse matrix efficiently in 
required storage space and minimize the overhead in accessing this data 
structure access. 

The section is constructed from two basic structures: row and column 
structures. Each row structure is followed by N column structures 
specified in the NumberColumns field. Note that this is an optional 
section. But with appropriate profile data, the SAS client prefetcher 
performance can be increased. 

Row Structure: (variable number of entries) 

o FileNumber [4 bytes]: File Number of the row block 

o BlockNumber [4 bytes]: Block Number of the row block 

o NumberColumns [4 bytes]: number of blocks that follows this block. 

This field determines the number of column structures following 

this field. 

Column Structure: (variable number of entries) 

o FileNumber [4 bytes]: File Number of the column block 
o BlockNumber [4 bytes]: Block Number of the column block 
o Frequency [4 bytes]: frequency the row block is followed by column 
block 

6. Comment Section: 

The comment section 3106 is used by the Builder to describe this 
ApplnstallBlock in more detail. 
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o CommentLengthlsAnsi [1 byte]; 1 if string is ansi, 0 if Unicode 
format. 

o CommentLength [4 bytes]: Byte size of the comment string 
o Comment [X bytes]: Null terminating comment string 

5 

7. Code Section: 

The code section 3107 consists of the application-specific initialization 
code needed to run on the SAS client to setup the client machine for this 
particular application. This section may be empty if the default 
m 10 initialization procedure in the SAS client is able to setup the client 
^ machine without requiring any application-specific instructions. On the 

\i Windows system, the code is a DLL file containing two exported function 

Irt calls: InstallO, Uninstall(). The SAS client loads the DLL and Invokes the 

f appropriate function calls, 

M 15 

JS o CodeLength [4 bytes]: Byte size of the code 

Q o Code pc bytes]: Binary file containing the application-specific 

initialization code. On Windows, this is just a DLL file. 

20 8. LicenseAgreement Section: 

The Builder creates the license agreement section 3108. The SAS 
client displays the license agreement text to the end-user before the 
application is started for the first time. The end-user must agree to all 
licensing agreement set by the software vendor in order to use the 

25 application. 

o LicenseTextlsAnsI [1 byte]: 1 if ansi, 0 if Unicode format, 

o LicenseTextLength [4 bytes]: Byte size of the license text 

o LicenseAgreement \X bytes]: Null terminating license agreement 

30 string 

CLIENT INSTALLATION AND EXECUTION OF STREAMED APPLICATIONS 
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Summary 
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This section describes the process of installing and uninstalling streamed 
application on the client machine. With respect to Fig. 32, the lifecycie of the 
5 Application Install Block is shown. The Application Stream Builder 3202 
takes original application files 3201 and produces a corresponding 
Application Install Block and Stream Application Set 3203. These two files 
get installed onto the application servers 3206. On the right side of the 
drawing, it shows how either the administrator or the user can subscribe to 
10 the application from either the client computer 3208 or an administration 
computer 3207. Once the user logons onto the client computer 3208, the 
license and the AIB 3203 are acquired from the license 3205 and application 
servers 3206, respectively. 

15 The following are features of a preferred embodiment of the Invention: 

1. The streamed application installation process installs just the 
description of the application, not the total content of the application. 
After installing such description, the client system looks and feels 
20 similar to having installed the same app using a non-streamed 

method. This has the following benefits: 

a. Takes a very small fraction of the time to install the application. 

b. Takes a very small fraction of the disk space. 

c. Client does not have to wait for the entire application to be 
25 downloaded. This is particularly important to users with slow 

network connections. 



The application description Is subsequently un-installed without 
requiring deleting the total contents of the application. This has the 
30 benefit that it takes a very small fraction of the time to uninstall the 

application. 



2. Enhancing streamed application's performance by: 
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* a. Copying small portions of application's code and data (pages) 

that are critical to performance, 
b. Providing client with the initial profile data that can be used to 
perform pre-fetching. 

5 

This has the following benefits: 

1 . User experiences smooth and predictable application launch. 

2. Scalability of Application servers increases by reducing the 
U number of client connections. 



-jar. 
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y 3. An administrator can arrange applications to be installed 

iff 

automatically on client computers. Administrator can also arrange the 
:| installation on various client computers simultaneously without being 

^ physically present on each client computer. This has the following 

y, 15 benefits: 



20 



a. Users are not burdened with the process of installing streamed 
applications. 

b. Reduced administration expense. 



Overview of Components Relevant to the Install Process 

• Subscription Server 3204: allows users to create accounts & to rent. 

• License Server 3205: authenticates users & determines licensing 
rights to applications. 

25 • Application Server 3206: provides application bits to licensed users 
securely & efficiently. 

• Application Install Manager - a component installed on the streaming 
client that is responsible for installing and uninstalling streamed 
applications. 

30 • Application Install Block (AIB) 3203 - a representation of what gets 
installed on the client machine when a streamed application is 
installed. It contains portions of the application that are responsible for 
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registering the application with the client operating system and other 
data that enhances the execution of streamed application. 
Application Stream Builder 3202 - preprocesses apps & prepares 
files to be installed on Application Server and data, such as AIB, to be 
installed by Client Application Installer. 

Stream Application Set 3203 - a method of representing the total 
content of the application in a format that is optimal for streaming. 
Client Streaming File System - integrates client exec with paging from 
a special file system backed by remote network-accessed server- 



O 10 based store 

Application Install Block (AIB) 



Installing and un-installing a stream application requires an understanding 
^ 1 5 of what AIB Is and how it gets manipulated by the various components in the 



overall streaming system. AIB is physically represented as a data file with 
various different sections. Its contents include: 



o Streamed application name and identification number. 

20 o Software License Agreement, 

o Registry spoof set. 

o File spoof set. 

o Small number of application pages - initial cache contents, 

o Application Profile Data. 

25 

AIB Lifecycle 



The following describes the AIB lifecycle: 



30 1. Using the process described in the section above concerning 
converting apps for stream delivery and subsequent execution, an 
application install block is created by the Application Stream Builder. 
Initially, there will be one AIB per application, however, as the 
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application evolves via patches and service packs, new AlBs may 
need to be generated. 



2. Using a process described in the section above regarding server-side 
performance optimization, AIB will get hosted by the application 
servers. 

3. "Subscribing" the application by communicating with the subscription 
server. Subscribing to an application requires a valid account with the 
ASP. Either the user or an administrator acting on the user's behalf 
can subscribe the application. In addition, the application can be 
subscribed to from any computer on the Internet, not just the client 
machine where the application will be eventually installed. This 
allows an administrator to subscribe applications for a group of users 
without worrying about individual client machines. 

4. The client machine acquires the license for the application from the 
license server. If the application was subscribed from the client 
machine itself, this step will happen automatically after subscribing to 
the application. If the subscription happened from a different machine, 
e.g., the administrator's machine, this step will happen when the user 
logs on the client machine. As an acknowledgment of having a valid 
license, the license server gives the client an encrypted access token. 

5. Fetch the contents of AIB from the application server. This step is 
transparent and happens immediately after the preceding step. Since 
application server requires the client to possess a valid access token, 
it ensures that only subscribed and licensed users can install the 
streamed application. 

6. The Application Install Manager (AIM) performs the act of installing the 
application information, as specified by the AIB, on the client system. 
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Installing a Streamed Application 



AIM downloads AIB from the application server and takes the necessary 
steps in installing the application description on the client system. It extracts 
5 pieces of information from AIB and sends messages to various other 
components (described later) to perform the installation. AIM also creates an 
lnstall"Log that can be used when un-installing the streamed application. 

1 . Display a license agreement to the user and wait for the user to agree 
1 0 to it. 

CI 2. Extract File Spoof Data and communicate that to the Client File 

Spoofer. The list of files being spoofed will be recorded in the Install- 
Log. 

3. Extract Registry Spoof Data and communicate that to the Client 
15 Registry Spoofer. The list of Registries being spoofed will be recorded 

in the Install-Log. 

C5 4. Extract Initial Cache Content and communicate that to the Client 

Prefetch Unit. 

5. Extract Profile Data and communicate that to the Client Prefetch Unit. 
20 6. Save the Install-Log in persistent storage. 
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Un-lnstalling a Streamed Application 



Un-installation process relies on the Install-Log to know what specific items 
25 to un-install. Following steps are performed when un-installing and 
application: 



1 . Communicate with the Client Registry Spoofer to remove all registries 
being spoofed for the application being un-installed. 
30 2. Communicate with the Client File Spoofer to disable all files being 
spoofed for the application being un-installed. 

3. Communicate with the Client Prefetch Unit to remove all Profile Data 
for the application being un-installed. 
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4. Communicate with the Client Cache Manager to remove all pages 
being cached for the application being un-installed. 

5. Delete the Install-Log. 



5 Client File Spoofer 



A file spoofer component is installed on the client machine and is 
responsible for redirecting file accesses from a local file system to the 
streaming file system. The spoofer operates on a file spoof database that is 
0 stored persistently on the client system; it contains a number of file maps with 
3 following format: 

SI [Original path of a local file] <— > [New path of a file on streaming drive] 

: 15 Where "<-->" indicates a bi-directional mapping between the two sides of the 

hide 

M relationship shown. 



u When a streamed application is installed, the list of new files to spoof (found 
in AIB) is added to the file spoof database. Similarly, when a streamed 
20 application is un-installed, a list of files being spoofed for that application is 
removed from the file spoof database. 



On clients running the Windows 2000 Operating System, the file spoofer is a 
kernel-mode driver and the spoof database is stored in the registry. 

25 

Client Registry Spoofer 

The Registry Spoofer intercepts all registry calls being made on the client 
system and re-directs calls manipulating certain registries to an alternate 
30 path. Effectively, it is mapping the original registry to an alternate registry 
transparently. Similar to the client file spoofer, the registry spoofer operates 
on a registry spoof database consisting entries old/new registry paths. The 
database must be stored in persistent storage. 
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When a streamed application is installed, the list of new registries to spoof 
(found in AIB) is added to the registry spoof database. Upon un-installation 
of a streamed application, its list of spoofed registries is removed from the 
registry spoof database. 

On clients running the Windows 2000 Operating System, the registry spoofer 
is a kernel-mode driver and the registry spoof database is stored in the 
registry. 

Client Prefetch Unit 

In a streaming system, it is often a problem that the initial invocation of the 
application takes a lot of time because the necessary application pages are 
not present on the client system when neeeded. A key aspect of the client 
install is that by using a client prefetch unit, a system in accordance with the 
present invention significantly reduces the performance hit associated with 
fetching. The Client Prefetch Unit performs two main tasks: 

1. Populate Initial Cache Content, 

2. Prefetch Application Pages. 

Initial Cache Content 

The Application Stream Builder determines the set of pages critical for the 
initial invocation and packages them as part of the AIB. These pages, also 
known as initial cache content, include: 

o Pages required to start and stop the application, 
o Contents of frequently accessed directories, 
o Application pages performing some of the most common 
operations within application. For example, if Microsoft Word Is 
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being streamed, these operations include: opening & saving 
docunnent files & running a spell checker. 

When the Stream Application is installed on the client, these pages are put 
into the client cache; later, when the streamed application is invoked, these 
pages will be present locally and network latency Is avoided. 

In preparing the Prefetch data, It is critical to manage the trade off of how 
many pages to put into AlB and what potential benefits it brings to the initial 
application launch. The more pages that are put into prefetch data, the 
smoother the initial application launch will be; however, since the AlB will get 
bigger (as a result of packing more pages in it), users will have to wait longer 
when installing the streamed application. In a preferred embodiment of the 
invention, the size of the AlB Is limited to approximately 250KB. 

In an alternative embodiment of the Invention the AlB Initially includes only 
the page/file numbers and not the pages themselves. The client then goes 
through the page/file numbers and does paging requests to fetch the 
indicated pages from the server. 

Prefetch Application Pages 

When the streaming application executes, it will generate paging requests 
for pages that are not present in the client cache. The client cache manager 
must contact the application sender and request the page in question. The 
Invention takes advantage of this opportunity to also request additional 
pages that the application may need in the future. This not only reduces the 
number of connections to the application server, and overhead related to 
that, but also hides the latency of cache misses. 

The application installation process plays a role In the pre-fetching by 
communicating the profile data present in the AlB to the Client Prefetch Unit 
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when the application is installed. Upon un-instaltation, profile data for the 
particular application will be removed. 

CACHING OF STREAMED APPLICATION PAGES WITHIN THE NETWORK 
Summary 

This section describes how collaborative caching is employed to 
substantially improve the performance of a client server system in 
accordance with the other aspects of the present invention. Specifically, 
particular caching configurations and an intelligent way to combine these 
caching configurations are detailed. 

Collaborative Caching Features: 

• Using another client's cache to get required pages/packets (Peer 
Caching) 

• Using an intermediate proxy or node to get required pages/packets 
(Proxy Caching) 

• Using a broadcasting or multicasting mechanism to make a request 
(Multicast) 

• Using a packet based protocol to send requested pages/packets 
rather than a stream based one. (Packet Protocol) 

• Using concurrency to request a page through all three means (Peer 
Caching or Proxy Caching or the actual server) to improve 
performance (Concurrent Requesting). 

• Using heuristical algorithms to use all three ways to get the required 
pages (Smart Requesting). 

These features have the following advantages: 
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• These ideas potentially improve the performance of the client, i.e., 
they reduce the time a client takes to download a page (Client 
Performance). 

• These ideas improve the scalability of the server because the server 
5 gets fewer requests, i.e., requests which are fulfilled by a peer or a 

proxy don't get sent to the server. (Server Scalability) 

• These allow a local caching mechanism without needing any kind of 
modification of local proxy nodes or routers or even the servers. The 
peer-to-peer caching is achieved solely through the co-operation of 

10 two clients. (Client Only Implementation) 

• These Ideas allow a client to potentially operate "offline" i.e., when it is 
not getting any responses from the server (Offline Client Operation). 

• These ideas allow the existing network bandwidth to be used more 
effectively and potentially reduce the dependency of applications on 

15 higher bandwidth (Optimal Use of Bandwidth). 

• These ideas when used in an appropriate configuration allow each 
client to require a smaller local cache but without substantially 
sacrificing the performance that you get by local caching. An example 
is when each client "specializes" in caching pages of a certain kind, 

20 e.g., a certain application. (Smaller Local Cache). 

These ideas involve new interrelationships - peer-to-peer communication 
for cache accesses; or new configurations - collaborative caching. The 
reason this is called collaborative is because a group of clients can 
25 collaborate in caching pages that each of them needs. 

Aspects of Collaborative Caching 

1 . Peer Caching: A client X getting its pages from another client Ys local 
30 cache rather than its (X's) own or from the server seems to be a new 

idea. Major advantages: client performance, server scalability, client 
only implementation, offline client operation, optimal use of 
bandwidth, smaller local cache. 
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2. Proxy Caching; The client getting its pages from an intermediate proxy 
which either serves the page from the local cache or from another 
intermediate proxy or the remote server (if none of the intermediate 
proxies has the page) is unique, at a minimum, for the pages of a 
streamed application. Major advantages: client performance, server 
scalability, offline client operation (to some extent), optimal use of 
bandwidth, smaller local cache. 

3. Multicast: Using multicasting (or selective broadcasting) considerably 
reduces peer-to-peer communication. For every cache request there 
is only one packet on the network and for every cache response there 
is potentially only one packet on the network in some configurations. 
This definitely helps reduce network congestion. Major advantages: 
client performance, server scalability, client only implementation, 
offline client operation, optimal use of bandwidth 

4. Packet Protocol: Because only datagram packets are used to request 
or respond to cache pages this saves the overhead of opening 
stream-based connections such as a TCP connection or an HTTP 
connection. Major advantages: client performance, client only 
implementation, offline client operation, and optimal use of bandwidth. 

5. Concurrent Requesting: If concurrent or intelligently staggered 
requests through all three means are issued to request a single page, 
the client will be able to receive the page through the fastest means 
possible for that particular situation. Major advantages: client 
performance, server scalability, offline client operation, and optimal 
use of bandwidth 

6. Smart Requesting: An adaptive or "smarf' algorithm can be used to 
further enhance the overall performance of the client-server system. In 
this algorithm, the client uses the data of how past requests were 
processed to "tune" new requests. For example, if the client's past 
requests were predominantly served by another client, Le,, Peer 
Caching worked, then for new page requests the client would first try 
to use Peer Caching, and wait some time before resorting to either 
Proxy Caching or direct server request. This wait time can again be 
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calculated in an adaptive fashion. Major advantages: client 
performance, server scalability, client only implementation, offline 
client operation, and optimal use of bandwidth. 



5 The concepts illustrated herein can be applied to many different problem 
areas. In all client-server implementations where a server is serving requests 
for static data, e.g., code pages of a streamed application or static HTML 
pages from a Website, the approaches taught herein can be applied to 
improve the overall client-server performance. Even if some of the protocols 
10 or configurations described in this document are not supported by the 
^ underlying network, it does not preclude the application of other ideas 

III described herein that do not depend on such features. For example, if 

multicast (or selective broadcast) is not supported, ideas such as Concurrent 
Requesting or Smart Requesting can still be used with respect to multiple 
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U 1 5 servers instead of the combination of a server, peer, and proxy. Also the use 
h of words like Multicast does not restrict the application of these ideas to 



multicast based protocols. These Ideas can be used in all those cases where 
a multicast like mechanism, Le,, selective broadcasting is available. Also 
note that the description of these Ideas in the context of LAN or Intranet 

20 environment does not restrict their application to such environments. The 
ideas described here are applicable to any environment where peers and 
proxies, because of their network proximity, offer significant performance 
advantages by using Peer Caching or Proxy Caching over a simple client- 
server network communication. In that respect, the term LAN or local area 

25 network should be understood to mean more generally as a collection of 
nodes that can communicate with each other faster than with a node outside 
of that collection. No geographical or physical locality is implied in the use of 
the term local area network or LAN. 

30 Peer Caching 

Referring to Fig. 33, how multiple peers collaborate in caching pages that 
are required by some or all of them is shown. 
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The main elements shown are: 

• Client 1 3301 through Client 6 3306 in an Ethernet LAN 3310. 

• Router 1 and the local proxy serving as the Internet gateway 3307, 
5 Note that it does not matter whether Router 1 and the proxy are one 

computer or two different ones. 

• Other routers from router 2 through router N 3308 that are needed to 
connect the LAN 3310 to the Internet 331 1. 

p • A remote server 3309 (that is reachable only by going over the 

?S 10 Internet 3311) that is serving the pages that the above mentioned 

O clients need. 

in 

SI • A cloud that symbolizes the complexity of the Internet 3311 and 

m 

y§ potentially long paths taken by packets. 

» 

1-^ 15 Client 2 3302 needs a page that it does not find in its local cache. It then 

25 decides to use the mechanism of Peer Caching before attempting to get the 

P page from the local proxy (or the actual server through the proxy). The actual 
sequence of events is as follows: 

20 1. Client 2 3302 sends a request for the page it needs. This 

request is sent as a multicast packet to a predetermined 
multicast address and port combination. Lets call this multicast 
address and port combination as M. 

2. The multicast packet is received by all the clients that have 
25 joined the group M. In this case all six clients have joined the 

group M. 

3. Client 5 3305 receives the request and it records the sender's, 
i.e., Client 2's 3302, address and port combination. Let's 
assume this address and port combination is A. Client 5 3305 

30 processes the request and looks up the requested page in its 

own cache. It finds the page. 

4. Client 5 3305 sends the page to address A (which belongs to 
Client 2 3302) as a packet. 
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5. Client 2 3302 receives tlie page it needs and hence does not 
need to request the server for the page. 

Proxy Caching 

With respect to Fig. 43, a transparent proxy and how clients use it to get 
pages Is shown. Again the elements here are the same as in the previous 
figure: 

• Client 1 3401 through Client 6 3406 in an Ethernet LAN 3410. 

• Router 1 and the local proxy serving as the Internet gateway 3407. 
Note that it does not matter whether Router 1 and the proxy are one 
computer or two different ones. 

• Other routers from router 2 through router N 3408 that are needed to 
connect the LAN 3410 to the Internet 341 1 . 

• A remote server 3409 (that is reachable only by going over the 
Internet 3411) that is serving the pages that the above mentioned 
clients need. 

• A cloud that symbolizes the complexity of the Internet 3411 and 
potentially long paths taken by packets. 

Assume Peer Caching is either not enabled or did not work for this case. 
When Client 2 3402 needs a page, it makes a request to the proxy 3407. The 
proxy 3407 finds the page in its local cache and returns It to Client 2 3402. 
Because of this, the request did not go to the remote server 3409 over the 
Internet 3411. 

Multicast and Packet Protocol within a LAN 

Referring to Fig. 35, the role played by multicast and unicast packets in Peer 
Caching is shown. The example of the drawing "Peer Caching" is used to 
explain multicast. Here Client 2 3502 has the IP address 10.0.0.2 and it 
opens port 3002 for sending and receiving packets. When Client 2 3502 

97 



10 



20 



needs a page and wants to use Peer Caching to get it, it forms a request and 
sends it to the multicast address and port 239.0.0.1 :2001 . All the other clients 
in the LAN 3508 that support Peer Caching have already joined the group 
239.0.0.1:2001 so they all receive this packet. 

Client 5 3505 receives this packet and it records the sender address 
(10.0.0.2:3002 in this case). It looks up the requested page and finds it in its 
local cache, ft sends the page as a response packet to the address 
10.0.0.2:3002. 



O Client 2 3502 receives this response packet since it was waiting at this port 

m after sending the original multicast request. After ensuring the validity of the 

response, it retrieves the page it needs. 



1 5 Note that more than one client can respond to the original multicast request. 
f!f; However Client 2 3502 can discard all the later responses, since it has 

already received the page it needed. 



Concurrent Requesting - Proxy First 



With respect to Fig. 36, one particular case of how Concurrent Requesting is 
used is shown. This is a timeline of events that take place in the client. When 
a client first needs a page, it does not know whether it is going to get any 
responses through Peer Caching or not. Hence it issues a request to the 

25 proxy (or the server through the proxy) as soon as it needs the page. Then it 
issues a request using the Peer Caching mechanism. If there is indeed a 
peer that can return the page requested, the peer presumably could return 
the page faster than the proxy or the server. If this happens, the client may 
decide to use Peer Caching mechanism before attempting to get the page 

30 from the proxy or the server. The timeline essentially describes the following 
sequence of events: 

1 . At time t = 0, a page p is needed by the client 3601 . 
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2. The client looks up its local cache, and it doesn't find page p. 

3. At time t = T1 , it decides to send a request to the proxy to get the page 
3602. 

4. After a delay of amount Dp 3603, at time t = T2 it also sends a request 
5 for the page p through the mechanism of Peer Caching 3604. Note 

that Dp 3603 can be zero, in which case T1 = T2. 

5. At time t = T3, a response is received from another peer that contains 
the page p that this client needs 3606. Thus the response time of the 
Peer Caching mechanism is Rp = T3 - T2 3605. 

10 6. At time t = T4, a response from the proxy/server is received that 
Q contains the page p 3608. Hence the response time of the 

J proxy/server is Rg = T4 - T1 3607. 



# Note that since, Rp < Rs, the client will increase the weighting for Peer 

15 Caching in all of its future queries. That means It will decrease Dp, and if Dp 
S is already zero, it will increase Dg (the delay before requesting proxy/server). 

On the other hand, if Rp > Rs or if Rp were infinity, it will increase its weighting 
for proxy/server requesting. This is part of Smart Requesting that is 
explained elsewhere in this document. 
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Concurrent Requesting - Peer Caching First 



Referring to Fig. 37, in contrast to the previous figure, the client has decided 
to use Peer Caching before requesting the proxy. So the sequence of events 
25 is as follows: 



1 . At time t = 0, a page p is needed by the client 3701 . 

2. The client looks up its local cache, and it doesn't find page p. 

3. At time t = T5, it decides to send a request for the page p through the 
30 mechanism of Peer Caching 3702. 

4. After a delay of amount Dg 3703, at time t = T6 it also sends a request 
for the page p to the proxy/server. Note that Ds can be zero, in which 
case T5 = T6. 
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5. At time t = T7, a response is received from another peer that contains 
the page p that this client needs 3706. Thus the response time of the 
Peer Caching mechanism is Rp = T7 - T5 3705. 

6. At time t = T8, a response from the proxy/server is received that 
5 contains the page p 3708. Hence the response time of the 

proxy/server is Rg = T8 - T6 3707. 

As described in the previous drawing, the client increases the weighting of 
1,^ Peer Caching even more because it got a response through Peer Caching 

10 long before it got a response from the proxy/server. As a result of the 
increases weighting the delay is increased even more. 



M 



m 



d1 
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Concurrent Requesting - Peer Caching Only 



15 With respect to Fig. 38, in contrast with Fig. 37, the client has increased Dg 
Q 3805 (the delay before requesting a proxy/server) so much, that if a page is 

CS received before the expiry of the delay Dg 3805, the client does not even 

make a request to the proxy/server. The shaded area 3806 shows the events 



that do not take place because of this. 
Client-Server System with Peer and Proxy Caching 



Referring to Fig. 39, a system level drawing that gives a system context for all 
the other figures and discussion in this document is shown. This drawing 
25 illustrates all three ways in which a client gets its page requests fulfilled. 
Note that: 

• Client 2 3902 gets its page request fulfilled through Peer Caching, i.e., 
multicast request. 

30 • Client 1 3901 gets Its page request fulfilled through Proxy Caching, 
/.a, the proxy 3907 finds the page in its cache and returns it. 

• Client 3 3903 has to go to the server 3909 over the Internet 3908 to 
get its page request fulfilled. 
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Collaborative Caching Details 

In a typical client-server model, caching could be used to Improve the 
performance of clients and scalability of servers. This caching could be: 

• Local to the client where the client itself locally stores the pages it had 
received from the server in the past. Then the client would not need to 
request the proxy/server for any page that resides In the local cache 
as long as the locally cached page is "valid" from the server point of 
view. 

• On a proxy node that can be any node along the path taken by a 
packet that goes from the client to the server. The closer this proxy 
node is to the client the more improvement in the performance you 
get. 

• On a peer node, that is on another client. In this case, the two clients 
(the requesting client as well as the serving client) are on the same 
LAN or intranet, so that the travel time of a packet between the two 
nodes is considerably smaller as compared to the travel time of the 
packet from one of the clients to the server. 

As far as caching is concerned, this section details the new ideas of Peer 
Caching and Proxy Caching. In addition, It also details the new ideas of 
Concurrent Requesting and Smart Requesting. The preferred approaches 
for implementing these ideas are also described here and these are 
Multicast and Packet Protocol. 

The idea of Peer Caching is nothing but a client X taking advantage of the 
fact that a peer, agr., say another client Y, on its LAN had, in the past, 
requested a page that X is going to request from its server. If the peer Y has 
that page cached locally on its machine, then X could theoretically get it 
much faster from Y than getting it from the server itself. If an efficient 
mechanism is provided for the two clients X and Y to collaborate on this kind 
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of cache access, then that will offer many advantages such as: Client 
Performance, Server Scalability, Client Only Implementation, Offline Client 
Operation, Optimal Use of Bandwidth, Smaller Local Cache. Note that two 
clients were considered only as an example, the idea of Peer Caching Is 
5 applicable to any number of peers on a LAN. 

The idea of Multicast is to use the multicast protocol in the client making a 
Peer Caching request. Multicast can be briefly described as "selective 
broadcasting" ~ similar to radio. A radio transmitter transmits "information'' on 
10 a chosen frequency, and any receiver (reachable by the transmitter, of 
course) can receive that information by tuning to that frequency. In the realm 
of multicast, the equivalent of a radio frequency is a multicast or class D IP 
address and port. Any node on the net can send datagram packets to a 
multicast IP address+port. Another node on the net can "join" that IP 
Q 15 address+port (which is analogous to tuning to a radio frequency), and 
2 receive those packets. That node can also "leave" the IP address+port and 
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O thereby stop receiving multicast packets on that IP address+port. 

u 

Note that multicast is based on IP (Internet Protocol) and is vendor neutral. 
20 Also, it is typically available on the Ethemet LAN and, if routers supported it, 
it can also go beyond the LAN. If all the routers involved in a node's 
connection to the Internet backbone supported multicast routing, multicast 
packets theoretically could go to the whole Internet except the parts of the 
Internet that do not support multicast routing. 

25 

The use of multicast allows a client to not have to maintain a directory of 
peers that can serve its page requests. Also because of multicast there is 
only one packet per page request. Any peer that receives the request could 
potentially serve that request, so by using a multicast based request there 
30 are multiple potential servers created for a page request but only one 
physical packet on the network. This contributes substantially in reducing 
network bandwidth, but at the same time increasing peer accessibility to all 
the peers. When implemented properly, the packet traffic due to Peer 
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Caching will be proportional to the number of clients on the network 
participating in Peer Caching. 



An idea related to Multicast is Packet Protocol. Note that IVlulticast itself is a 
5 packet-based protocol as opposed to connection based. The idea of Peer 
Caching here is described using Multicast and Packet Protocol. The Peer 
Caching request is sent as a multicast request and the response from a peer 
to such a request is also sent as a packet (not necessarily a multicast 
packet). Sending packets is much faster than sending data through a 
10 connection-based protocol such as TCP/IP, although using packet-based 

protocol is not as reliable as using connection-based one. The lack of 

111 

sj reliability in Packet Protocol is acceptable since Peer Caching is used only 

^ to improve overall performance of the Clien^Server system rather than as a 

s primary mechanism for a client to get its pages. The underlying assumption 

rr 15 made here is that a client could always get its pages from the sen/er, if Peer 

Q Caching or Proxy Caching does not work for any reason. 



The ideas of Concurrent Requesting and Smart Requesting describe how 
Peer Caching, Proxy Caching and client-server access could be combined 

20 in an intelligent fashion to achieve optimal performance of the whole Client- 
Server system. As part of Concurrent Requesting, a client is always prepared 
to make concurrent requests to get the page it needs in the fastest way 
possible. Concurrent Requesting would require the use of objects such as 
threads or processes that would allow one to programmatically implement 

25 Concurrent Programming. This document assumes the use of threads to 
describe a possible and preferred way to implement Concurrent Requesting. 



The idea of Smart Requesting includes using an adaptive algorithm to 
intelligently stagger or schedule requests so that a client, even while using 
30 Concurrent Requesting, would not unnecessarily attempt to get a page 
through more than one means. An example of this is when a client has 
consistently gotten Its page requests fulfilled through Peer Caching in the 
past. It would come to depend on Peer Caching for future page requests 
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more than the other possible means. On the other hand, if Peer Caching has 
not worked for that client for some time, it would schedule a proxy request 
before a Peer Caching request. Smart Requesting involves dynamically 
calculating the delays Dp and Ds based how well Peer Caching and Proxy 
5 Caching has worked for the client. Please see Figs. 36 through 38. 

The following is an algorithmic description using pseudo-code of an 
illustrative embodiment. 

startOurClient is a function that is invoked initially when the client is started. 

void startOurClientO { 

Initialize the global variable delay to appropriate value based on a 
predfined policy. When delay is positive, it signifies the amount of time 
to wait after Proxy Caching before Peer Caching is attempted; and 
when delay is negative it signifies the amount of time to wait after Peer 
Caching before Proxy Caching is attempted. As an example: 

delay = 50; 

Start a thread for peer responses (/.e.. Peer Caching server) with 
thread function as peerServer; 

} 

25 getPage function 

The function getPage is called by the client's application to get a page. This 
function looks up the local cache and if the page is not found, attempts to get 
the page from a peer or proxy/server using the ideas of Concurrent 
30 Requesting and Smart Requesting. 

void getPage(PageldType pageld) { 

if pageld present in the local cache then { 



10 



h4k 



15 



104 



retrieve it and return it to the caller; 

} 

If (delay > 0) { 

myDelay = delay; 
5 Call requestProxy(pageld); 

} 

else { 

myDelay = -delay; 

Call requestPeer(pageld); 

S 10 } 

Wait for gotPage event to be signaled for a maximum of myDelay 
milliseconds; 

If the page was obtained as indicated by gotPage being signaled { 
s Modify delay appropriately i.e., if the page was obtained 

15 through Proxy Caching Increment delay else decrement it; 

Return the page; 

} 

if (delay > 0) { 

Call requestPeer(pageld); 

20 } 

else { 

Call requestProxy(pageld); 

} 

Wait for the page to come through either methods; 
25 Depending on how the page came (through Proxy Caching or Peer 

Caching) increment or decrement delay; 
Return the page; 
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30 requestProxy function 



The function requestProxy sends a page request to the proxy and starts a 
thread that waits for the page response (or times out). The function 
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proxy Response is the thread function that waits for the response based on 
the arguments passed to it. 



void requestProxy(pageld) { 
5 Send a page request for pageld to a predefined proxy/server as per 

the proxy/server protocol; 

Start a thread with the thread function proxyResponse that waits for 
the response to the request - the function proxyResponse is passed 
arguments: the socket X where it should wait and pageld. 

m 10 } 

S 
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void proxyResponse(socket X, pageld) { 

Wait at the socket X for a response with a timeout of time TY; 
If a response was received at socket X { 
^ 1 5 Uncompress the packet if necessary; 

Validate the packet and ensure that this is a 
valid response to the request and has the page requested {Le., 
match the pageld); 

} 

20 else { 

// this is time out: didn't receive any 
// response in time TY 

Set appropriate indicator to indicate time-out; 

} 

25 Signal an event to signify completion of this thread; 

} 

requestPeer and peerResponse functions 



The function requestPeer is similar to requestProxy except that it sends a 
30 page request to peers and starts a thread that waits for the page response 
(or times out). The function peerResponse is the thread function that waits for 
the response based on the arguments passed to it. 
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Void requestPeer(pagelcl) { 

Create a UDP socket X bound to port 3002; 
Compose a packet that consists of: 

• a code indicating that this is a request for a page 

5 • Some kind of an identifier that uniquely identifies the page 

wanted such as the URL 

• other info such as security information or access validators 
Send this packet as a multicast packet to 239.0.0.1:2001 through the 
socket X created above; 

0 

Create a thread with the thread function peerResponse and pass 
socket X and pageld as arguments to it; 



15 Void peerResponse(socket X, pageld) { 

Wait at the socket X for a response with a timeout of time TX; 
If a packet was received at socket X { 

Uncompress the packet if necessary; 
Validate the packet and ensure that this is a 
20 valid response to the request and has the page requested (/.e., 

match the pageld); 

} 

else { 

II this is time out: didn't receive any 
25 // response in time TX 

Set appropriate indicator to indicate time-out; 

} 

Signal an event to signify completion of this thread; 

} 

30 

peerServer function 
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The function peerServer described below serves page requests received 
through Peer Caching as multicast packets. The function below describes 
how this thread would work: 

5 void peerServerQ { 

Create a nnultlcast socket M bound to port 2001 ; 
Have M "join" the IP address 239.0.0.1; 
while (not asked to terminate) { 

Wait at M for a multicast packet; 
10 If a packet is received then { 

Store the source IP addr in S along with the source port number in B; 
Validate the packet that it is a valid request for a page that can be 
served (with valid security credentials); 

Look up the page id in the local client cache; 
15 If the page Is found { 

Compose a packet that contains the pageld of the 
page as well as the page contents to send; 
Optionally compress the packet before sending; 
Send this packet to the IP address S at port B; 

20 } 
} 



25 PIRACY PREVENTION FOR STREAMED APPLICATIONS 

Summary 

The details presented in this section describe new techniques of the 
30 invention that have been developed to combat software piracy of 
applications provided over networks, In situations where an ASP's clients' 
machines execute the software applications locally. The remote ASP server 
must make all the files that constitute an application available to any 
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subscribed user, because it cannot predict with complete accuracy which 
files are needed at what point in time. Nor is there a reliable and secure 
method by which the server can be aware of certain information local to the 
client computer that could be useful at stopping piracy. The process may be 
a rogue process intent on pirating the data, or it may be a secure process run 
from an executable provided by the ASP. 

Aspects of the Invention 

1. Client-side fine-grained filtering of file accesses directed at remotely 
served files, for anti-piracy purposes. Traditional network filesystems 
permit or deny file access at the server side, not the client side. Here, 
the server provides blanket access to a given user to all the files that 
the user may need during the execution of an application, and makes 
more intelligent decisions about which accesses to permit or deny. 

2. Filtering of file accesses based on where the code for the process that 
originated the request is stored. Traditional file systems permit or 
deny file access usually based on the credentials of a user account or 
process token, not on where the code for the process resides. Here, a 
filesystem may want to take into account whether the code for the 
originating process resides in secure remote location or an insecure 
local location. 

3. Identification of crucial portions of served files and filtering file 
accesses depending on the portion targeted. The smallest level of 
granularity that traditional file systems can operate on Is at the level of 
files, not at the level of the sections contained in the files (for example, 
whether or not data from a code section or a resource section is 
requested). 

4. Filtering of file accesses based on the surmised purpose of the file 
access, as determined by examining the program stack or flags 
associated with the request. Traditional file systems do not attempt to 
determine why a file access was issued before permitting or denying 
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the access, e.g., whether the purpose is to copy the data or page in 
the data as code for execution. 
5. Filtering of file accesses based on the surmised purpose of the file 
access, as determined by examining a history of previous file 
accesses by the same process. Traditional file systems do not keep 
around histories of which blocks a given requestor had previously 
requested from a file. This history can be useful in seeing if the 
requests match a pattern that suggests a file copy is occurring as 
opposed to code execution. 

Benefits of the Anti-Piracy Features of the Present Invention 

This is an enabler technology that allows a programmer to build security into 
a certain type of application delivery system that would otherwise not be 
possible. Several companies are developing technology that allows an 
application to be served remotely, but executed locally. Current filesystems 
provide no way to protect the files that make up this application from being 
copied and thus pirated. The above techniques are tools that enable a 
filesystem to allow just those requests that will let the application run 
normally and block those that are the result of attempts to pirate the 
application's code or data. This provides a competitive advantage to those 
software providers who use this technology, because piracy results in lost 
revenue and, by preventing this, piracy they can prevent this loss. 

The techniques described herein were developed for the purpose of 
preventing the piracy of computer software programs that are served from a 
remote server, but executed on a local client. However, they can be used by 
any computer software security solution that would benefit from the ability to 
filter file accesses with more flexibility than currently provided by most 
filesystems. 

When a filesystem receives a request, It must decide whether or not the 
request should be granted or denied for security reasons. If the target file is 
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local, the filesystem makes the decision by itself, and if the target file is 
remote, it must ask the server to handle the request for it. The above 
techniques are ways in which the filesystem can gather more information 
about the request than it would ordinarily have. It can then use that 
5 information to improve the quality of its decisions. Traditional approaches, 
such as granting a currently logged-in user access to certain files and 
directories that are marked with his credentials, are not flexible enough for 
many situations. As for remote files, the server has only a limited amount of 
information about the client machine. The filesystem at the client side can 
^ 10 make grant/deny decisions based on local information before ever asking 
Q the server, in order to provide a more intelligent layer of security. 

^ For example, it may be desirable to allow the user to execute these files, but 

not copy them. It may be desirable to grant access to only certain processes 

U 15 run by the user, but not others, because it is judged that some processes to 
be more secure or well-behaved than others. And it may be desirable to 
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allow the user to access only certain sections of these files and from only 
certain processes for certain periods of time. The above techniques are tools 
that are added to a filesystem to give it these abilities. 

Overview of the Anti-Piracy Features of the Present Invention 



With respect to Fig. 40, preventing piracy of remotely served, locally 
executed applications is shown. This figure illustrates the problem of 

25 software piracy in an application delivery system, and how it can be stopped 
using the techniques described in this section. The client computer 4001 is 
connected to a server 4009 run by an ASP 4007. The server 4009 provides 
access to application files 4008, out of which the application executable is 
run by the client 4001 locally on his machine. (This is Process #1 4002). 

30 However, the user can attempt to access and copy the application files to 
local storage 4009 on his machine, and thus be able to run them without 
authorization or give them to another person. But since all requests directed 
at the remote files 4006 must first pass through the local network filesystem. 
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this filesystem can be enhanced 4005 to deny all such requests that it thinics 
are the result of an attempt at piracy. 
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Referring to Fig. 41, the filtering of accesses to remote application files, 
5 illustrating New Technique #1, as described above is shown. (Note; the 
client computer represented here and in all subsequent figures is part of the 
same client-server system as in Fig. 40, but the server/ASP diagram has 
been omitted to save space.) A user 4102 who has been granted access to 
remotely served files 4106 representing an application is attempting to 
10 access these files. The local enhanced network filesystem 4103 is able to 
y deny access to certain files 4105 and grant access to others 4104, for the 

U'l purpose of protecting critical parts of the application from piracy. 

M 

fti 

^ With respect to Fig. 42, the filtering of accesses to remote files based on 

15 process code location, illustrating New Technique #2, as described above, is 
shown. Here there are two processes on the client computer. Process #1 
4202 has been run from an executable file 4206 that is part of a remotely 
H served application 4207, and process #2 4203 has been run from a local 

executable file 4204. They are both attempting to access a remote data file 
20 4206 that is part of the served application 4207. The local enhanced network 
filesystem 4205 is denying Process #2 4203 access and granting Process 
#1 4202 access because Process #2's 4203 executable is stored locally, 
and thus is not secure, while Process #Vs 4202 executable is provided by 
the server 4207, and thus can be vouched for. 

25 

Referring to Fig. 43, the filtering of accesses to remote files based on 
targeted file section, illustrating New Technique #3, as described above, is 
shown. Here there is a single local process 4302 that is attempting to read 
from a remotely served executable file 4307. The enhanced network 
30 filesystem 4304 is denying an attempt to read from the code section 4306 of 
the file 4307 while granting an attempt to read from a non-code section 4305 
of the file 4307. This is useful when access to some part of the file must be 
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allowed, but access to other parts should be denied to prevent piracy of the 
entire file. 



With respect to Fig. 44, the filtering of accesses to remote files based on 
5 surmised purpose, illustrating New Technique #4 as described above, is 
shown. Here, two attempts to read from the code section 4407 of a remote 
executable file 4408 are being made from a process 4402 that was run from 
this file 4408. However, one request is denied because it originated 4406 
from the process's code 4403 itself, while another is approved because it 
p 10 originated from code in the Virtual Memory Subsystem 4404. This prevents 
^ even a rogue remote process from attempting to pirate its own code, while 

allowing legitimate requests for the code to be completed. 
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Referring to Fig. 45, the filtering of accesses to remote files based on past 
1^^ 15 access history, illustrating New Technique #5 as described above, is shown, 
p Here, two processes 4502, 4503 run from a local executable 4504 are 

attempting to access a remote file 4508. The enhanced network filesystem 
4507 keeps around a history of previous file accesses by these processes 
4505, 4506, which it consults to make decisions about permitting/denying 
20 further accesses. Process #1's 4502 access attempt is granted, while 
Process #2's 4503 is denied, because the filesystem 4507 detected a 
suspicious pattern in Process #2's 4503 previous access history 4506. 



Anti-Piracy Details of the Invention 

25 

Five anti-piracy embodiments are disclosed below that can be used by an 
ASP-installed network filesystem to combat piracy of remotely served 
applications. The ASP installs a software component on the client that is 
able to take advantage of local knowledge, eg., which process on the client 
30 originated a request for data, and permit or deny requests for remote files 
before sending the requests to the server. That is, a network filesystem Is 
installed on the local user's computer that manages access to these remote 
files. All input/output requests to these files must pass through this filesystem, 
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and if the filesystem determines that a given request is suspicious in some 
way, it has the freedom to deny it. 



Anti-Piracy Embodiment #1 : Client-side fine-grained filtering of file accesses 
5 directed at remotely served files, for anti-piracy purposes. 



Referring again to Fig. 41 , the approach of the first anti-piracy embodiment is 
that a software component 4102 executing locally on a client computer 4101 
has available to it much more information about the state of this computer 
h| 10 than does a server providing access to remote files. Thus, the server can 
filter access only on a much coarser level that can this client component. An 
ASP can take advantage of this by installing a network filesystem 4103 on 
the client computer that is designated to handle and forward all requests 
directed at files located on a given remote server. This filesystem 4103 
15 examines each request, and either grants or denies it depending on whether 
Q the request is justifiable from a security perspective. It can use information 

such as the nature of the originating process, the history of previous access 
M* by the process, the section of the targeted file being requested, and so on, in 

order to make its decision. 

20 

The best way known of implementing this approach is to write a network 
redirector filesystem component 4103 for the operating system that the 
ASP's clients' machines will be running. This component will be installed, 
and will make visible to the system a path that represents the server on 

25 which the ASP's application files are stored. The local computer can now 
begin accessing these files, and the filesystem 4103 will be asked to handle 
requests for these files. On most operating systems, the filesystem 4103 will 
register dispatch routines to the system that handle common file operations 
such as open, read, write and close. When a local process 4102 makes a 

30 request of an ASP-served file, the OS calls one of these dispatch routines 
with the request. In the dispatch routine, the filesystem 4103 examines the 
request and decides whether to deny it or grant it. If granted, it will forward 
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the request to the remote server and send back the response to the 
operating system. 



Anti-Piracy Embodiment #2: Filtering of file accesses based on where the 
5 code for the process that originated the request is stored. 

Referring again to Fig. 42, when a filesystem 4205 receives a request for 
access to a given file, the request always originates from a given process on 
the computer. By determining where the executable file that the process was 
Q 10 run from is located, the network filesystem 4205 can make a more informed 
decision about the security risk associated with granting the request. For 
example, if the executable file 4204 is located on the local computer 4202, 
then it may contain any code whatsoever, code that may attempt to copy and 



ss store the contents of any remote files it can gain access to. The filesystem 

15 4205 can reject requests from these processes as being too risky. However, 

3 if the executable file 4206 is being served by the ASP's remote server 4207, 

ill 

Q then the process can assume to be well-behaved, since it is under the 

control of the ASP. The filesystem 4205 can grant accesses that come from 
these processes 4202 in confidence that the security risks are minimal. 

20 

The best way known of implementing this approach is to modify a network 
filesystem 4205 to determine the identity of the process that originated a 
relevant open, read, or write request for a remote file. On some OSes a 
unique process ID is embedded in the request, and on others, a system call 

25 can be made to get this ID. Then, this ID must be used to look up the 
pathname of the executable file from which the process was run. To do this, 
upon initialization the filesystem 4205 must have registered a callback that is 
invoked whenever a new process is created. When this callback is invoked, 
the pathname to the process executable and the new process ID are 

30 provided as arguments, data which the filesystem 4205 then stores in a data 
structure. This data structure Is consulted while servicing a file request, in 
order to match the process ID that originated the request with the process's 
executable. Then the root of the pathname of that executable is extracted. 
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The root uniquely identifies the storage device or remote server that provides 
the file. If the root specifies an ASP server that is known to be secure, as 
opposed to a local storage device that is insecure, then the request can be 
safely granted. 

5 

Anti-Piracy Embodiment #3: Identification of crucial portions of served files 
and filtering file access depending on the portion targeted. 

Referring again to Fig. 43, a served application usually consists of many 
Q 10 files. In order to steal the application, a pirate would have to copy at least 
those files that store the code for the application's primary executable, and 
perhaps other files as well. This leads to the conclusion that some files are 
more important that others, and that some portions of some files are most 
important of all. Ordinarily, the best solution would be to deny access to the 
15 primary executable file and its associated executabies in its entirety, but this 
is not usually possible. In order to Initially run the application, the filesystem 
4304 must grant unrestricted access to some portions of the primary 
executable. In order to prevent piracy, the filesystem 4304 can grant access 
selectively to just those portions that are needed. Additionally, the running 
20 application 4302 itself does not usually need to read its own code section, 
but does need to read other sections for purposes such as resource loading. 
Therefore, additional security can be introduced by denying access to the 
code sections 4306 of ASP-served executabies 4307 even to those 
executabies themselves. 

25 

To implement this, modify a network filesystem's 4304 open file dispatch 
routine to detect when a remotely served executable 4307 is being opened. 
When this is detected, the executable file 4307 is examined to determine the 
offset and length of its code section 4306, and this information is stored in a 
30 data structure. On most OSes, executable files contain headers from which 
this Information can be easily read. In the read and write dispatch routines, 
the network filesystem 4304 checks if the request is for a remote executable 
4307, and if so, the offset and length of the code section 4306 of this 
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executable 4307 is read from the data structure in whicfi it was previously 
stored. Then the offset and length of the request are checked to see if they 
intersect the code section 4306 of this executable 4307. If so, the request 
can be denied. 

Anti-Piracy Embodiment #4: Filtering of file accesses based on the surmised 
purpose of the file access, as determined by examining the program stack or 
flags associated with the request. 



y 10 Referring again to Fig. 44, the approach of the fourth embodiment is that 
identical requests from the same process for a remotely served file can be 
distinguished based on the reason the request was issued. For example, on 
a computer with a virtual memory subsystem 4404, the VMS's own code will 
be invoked to page-In code for a process that attempts to execute code in 
15 pages that are not currently present. To do this, the VMS 4404 must issue a 
read request to the filesystem 4405 that handles the process' 4402 
executable file 4408. Since this request Is not for any ulterior purpose, such 
as piracy, and is necessary for the application to execute, the request should 
be granted. If the filesystem 4405 gets the originating process ID for such 
20 requests, the process whose code is being paged in will be known. 
However, this same process ID will also be returned for requests that 
originate as a result of an attempt by the process itself to read its own code 
(perhaps for the purpose of piracy). Many applications have loopholes that 
allow the user to execute a macro, for example, that reads and writes 
25 arbitrary files. If the filesystem 4405 simply filters requests based on process 
IDs, it will mistakenly allow users to pirate remotely served applications, as 
long as they can send the necessary reads and writes from within the remote 
application itself. 

30 However, even if the process IDs are the same for two apparently identical 
requests, there are ways the filesystem 4405 can distinguish them. There are 
two ways to do this in a manner relevant to combating anti-piracy. The way to 
implement the first method is to have the filesystem 4405, upon receiving a 
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read request, check for the presence of the paging I/O flag that is supported 
by several operating systems. If this flag Is not present, then the request did 
not come from the VMS 4404, but from the process itself 4403, and thus the 
request is risky and not apparently necessary for the application to run. If the 
flag is present though, the request almost certainly originated from the VMS 

4404 for the purpose of reading in code to allow the process to execute. The 
request should be allowed. 

Another way to make this same determination is to have the filesystem 4405 
examine the program stack upon receiving a read request. In several 
operating systems, a process will attempt to execute code that resides in a 
virtual page regardless of whether the page is present or not. If the page is 
not present, a page fault occurs, and a structure is placed onto the stack that 
holds information about the processor's current state. Then the VMS 4404 
gets control. The VMS 4404 then calls the read routine of the filesystem 

4405 that handles the process's executable file to read this code into 
memory. The filesystem 4405 now reads backwards up the stack up to a 
certain point, searching for the presence of the structure that is placed on the 
stack as a result of a page fault. If such a structure is found, the execution 
pointer register stored in the structure is examined. If the pointer is a memory 
address within the boundary of the virtual memory page that is being paged 
in, then the filesystem 4405 knows the read request is legitimate. 

Anti-Piracy Embodiment #5 : Filtering of file accesses based on the surmised 
purpose of the file access, as determined by examining a history of previous 
file accesses by the same process. 

Referring again to Fig. 45, if one looks at the series of file requests that are 
typically made as a result of attempting to copy an executable file, as 
opposed to those made in the course of executing that file, one can see 
certain patterns. The copy pattern is usually a sequence of sequentially 
ordered read requests, while the execution pattern tends to jump around a 
lot (as the result of code branches into non-present pages). A filesystem can 
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be enhanced to keep around a history of requests made by specific 
processes on remotely served files. Then, for every subsequent request to 
such a file, the history for the originating process can be examined to check 
for certain patterns. If a file-copy pattern is seen, then the pirate may be 
attempting to steal the file, and the request should be denied. If an execution 
type pattern is seen, then the user is simply trying to run the application, and 
the request should be granted. 

To implement this, a filesystem 4507 tells the operating system, via an 
operating system call, upon initialization, to call it back whenever a new 
process is created. When it is called back, the filesystem 4507 creates a new 
data structure for the process that will store file access histories 4505, 4506. 
Then, in its read-file dispatch routines, the filesystem 4507 determines the 
process ID of the originating process, and examines the process's access 
history 4505, 4506. It only examines entries in that history 4505, 4506 that 
refer to the file currently being requested. It will then run a heuristic algorithm 
that tries to determine if the pattern of accesses more closely resembles an 
attempted file copy than code execution. An effective algorithm is to simply 
see if the past n read requests to this file have been sequential, where n is 
some constant. If so, then the request is denied. If not, then the request is 
granted. In either case, an entry is made to the filesystem's process access 
history 4505, 4506 that records the file name, offset, and length of the 
request made by that process to this file. 

CONCLUSION 

Although the present invention has been described using particular 
illustrative embodiments, it will be understood that many variations in 
construction, arrangement and use are possible within the scope of this 
invention. Other embodiments may use different network protocols, different 
programming techniques, or different heuristics, in each component block of 
the invention. Specific examples of variations include: 



119 



• The proxy used In Proxy Caching could be anywhere in the Internet 
along the network path between a Client and the Server; and 

• Concurrent Requesting and Smart Requesting can be implemented in 
hardware instead of software. 

A number of insubstantial variations are possible in the implementation of 
anti-piracy features of the invention. For example, instead of modifying the 
filesystem proper to provide anti-piracy features, a network proxy component 
can be placed on the client computer to filter network requests made by a 
conventional local network filesystem. These requests generally correspond 
to requests for remote files made to the filesystem by a local process, and the 
type of filtering taught by the present invention can be performed on these 
requests. A filesystem filter component can also be written to implement 
these methods, instead of modifying the filesystem itself. 

Although the invention is described herein with reference to the preferred 
embodiment, one skilled in the art will readily appreciate that other 
applications may be substituted for those set forth herein without departing 
from the spirit and scope of the present invention. Accordingly, the invention 
should only be limited by the Claims included below. 
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